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^ . Abstract 

Q^l We review attempts to apply the variational principle to under- 

Q^' stand the vacuum of non-abelian gauge theories. In particular, we 

D . focus on the method explored by Ian Kogan and collaborators, which 

'"^ ' imposes exact gauge invariance on the trial Gaussian wave functional 

prior to the minimization of energy. We describe the application of the 
^ . method to a toy model — confining compact QED in 2+1 dimensions 

H ' — where it works wonderfully and reproduces all known non-trivial 

results. We then follow its applications to pure Yang-Mills theory in 
3+1 dimensions at zero and finite temperature. Among the results 
of the variational calculation are dynamical mass generation and the 
analytic description of the deconfinement phase transition. 
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1 Introduction 

Quantum Chroinodynainics (QCD) was formulated more than 30 years ago. 
There is very httle doubt that it is, indeed, the correct theory of strong 
interactions. An impressive amount of experimental data is successfully de- 
scribed by QCD calculations in the high momentum transfer domain. For 
these processes it is the short time - short distance dynamics that is rele- 
vant. Fortunately, in this domain QCD is weakly coupled and thus precise 
quantitative information can be obtained by analytic calculations. 

Of course, high momentum transfer processes are not the only, and per- 
haps not the most interesting, domain governed by QCD dynamics. The 
vacuum — low energy, large distance — sector of the strong interactions 
exhibits a wealth of phenomena which are both qualitatively striking at the 
experimental level and notoriously difficult to establish within the fundamen- 
tal framework of QCD. Understanding low energy phenomena in QCD, such 
as confinement and chiral symmetry breaking or, in more general terms, the 
strong coupling problem and the ground state structure of an asymptotically 
free non-abelian gauge theory is, without doubt, one of the main problems 
in the modern quantum field theory. In spite of years of attempts to answer 
these questions we are still far from achieving this goal. 

Many routes have been tried in approaching this problem. They range 
from the highly computer intensive numerical programme of the lattice gauge 
theory |lj , through the universality based concepts of effective field theories 
[2] to the monopole and vortex inspired searches for the confinement mech- 
anism [21111. In recent years, there have also been attempts to approach the 
non-perturbative physics of QCD utilizing the information about supersym- 
metric gauge theories jH]. 

While all these ideas are interesting and productive, each has its own 
drawbacks. The lattice gauge theory aims, in principle, at producing the 
complete set of numbers that characterize the QCD spectrum, condensates, 
various matrix elements, etc. This goal is, however, still not within reach. 
Also often one would like to understand the underlying physics rather than 
just calculate a given number albeit with an accuracy of a fraction of a 
percent, and this is difficult within a numerical approach. The effective field 
theory and dual superconductor approaches are based on simple physical 
pictures, but have their starting points rather far from actual QCD, so that 
making quantitative contact with QCD dynamics is very difficult. The SUSY 
motivated route suffers from the same basic problem, as it is not in principle 



clear how much the QCD dynamics is distorted by supersymmetry. 

One should therefore welcome any method which attempts to analytically 
obtain dynamical information directly from QCD, even if this information 
may be partial and incomplete. Such a method should be, of course, in- 
trinsically non-perturbative. Unfortunately, the arsenal of non-perturbative 
methods to tackle strongly interacting quantum field theories (QFT) is, to 
say the least, very limited. Methods that perform very well in simple quan- 
tum mechanical problems are much more difficult to use in QFT. This is true, 
for example, for a variational approach. In quantum mechanics it is usually 
enough to know a few simple qualitative features in order to set up a varia- 
tional Ansatz which gives pretty accurate results, not only for the energy of 
a ground state, but also for various other vacuum expectation values (VEV). 
In QFT one is immediately faced with several difficult problems when trying 
to apply this method, as discussed insightfully by Feynman ^. 

Nevertheless over the years there have been several attempts to apply dif- 
ferent versions of the variational Rayleigh-Ritz method to QCD. The purpose 
of this article is to review the results of this approach. We will concentrate 
most of our attention on the recent incarnation of the variational method pi- 
oneered by Ian Kogan with collaborators ^14j, which in terms of applications 
and results made much more headway than any of the previous attempts. 
This is not to say that it is free from problems and immune to criticism. 
Still, we believe that this is a good time to summarize its status for two 
reasons. First because some interesting quantitative results have been al- 
ready achieved; and, second, because there is plenty of room for improving 
the method, and some aspects can be improved with relative ease, so that 
clearly the method has not yet outlived its usefulness. 

As mentioned earlier, when applying a variational method to QFT one 
is faced with many difficulties. First of all there is a problem of generality 
of a trial state. The trial state ought to be general enough to allow for, 
through variation of its parameters, the relevant physics to be spanned. In 
quantum mechanics the task can be put down simply to identifying a few 
critical physical properties and consequently writing a compliant trial state. 
On the other hand, the Hilbert space of QFT is enormous, and it is much 
more difficult to identify "by pure thought" the relevant characteristics that 
have to be probed. 

Then there is the problem of calculability. That is, even if one had a very 
good guess at the form of the vacuum wave functional (or, for that matter, 
even knew its exact form) one would still have to evaluate expectation values 



of various operators in this state: 

(o)=y"D0vi/*[0]OM/[0]. (1) 

A calculation of this kind is, obviously, tantamount to the evaluation of a 
Euclidean functional integral with the square of the wave functional (WF) 
playing the role of the partition function. One should therefore be able to 
solve exactly a d-dimensional field theory with the action 

5[0] = -logM/*[0]M/[0]. (2) 

In quantum mechanics such a concern plays only a background role. What- 
ever the chosen trial state might be, the calculation to be performed involves 
integrals of functions. The evaluation of any such integral can be tackled, if 
not always analytically then numerically, without major complications. In 
QFT, where our ability to evaluate path integrals is, to say the least, lim- 
ited, the calculability restriction on the trial wave functional is very severe. 
Since in dimension d > 1 the only theories one can solve exactly are free 
field theories, the requirement of calculability almost unavoidably restricts 
the possible form of the trial WF to a Gaussian state: 

^[0] = exp L^jd'xd'y [<P{x) - C(x)] G-\x,y) [0(y) - C{y)]\ (3) 

with ({x) and G{x,y) being c-number functions. 

Another serious problem is that of "ultraviolet modes". The main moti- 
vation of a variational calculation in a strongly interacting theory is to learn 
about the distribution of the low momentum modes of the field in the vacuum 
wave functional. However, the VEV of the energy (and all other intensive 
quantities) is dominated entirely by contributions of high momentum fluctu- 
ations, for the simple reason that there are inflnitely more ultraviolet modes 
than modes with low momentum. Therefore, even if one has a very good idea 
of how the WF at low momenta should look like, if the ultraviolet part of the 
trial state is even slightly incorrect the minimization of energy may lead to 
absurd results. Due to the interaction between the high and low momentum 
modes, there is a good chance that the infrared (IR) variational parameters 
will be driven to values which minimize the interaction energy, and have 
nothing to do with the dynamics of the low momentum modes themselves. 



Finally, in gauge theories there is an additional complication: that of 
gauge invariance. The allowed wave functions must be invariant under the 
time independent gauge transformations. If one does not impose the Gauss' 
law on the states exactly, one is not solving the right problem. The QCD 
Hamiltonian is only defined on the gauge invariant states, and its action on 
non gauge invariant states can be modified at will. Thus, by minimizing 
a particularly chosen Hamiltonian without properly restricting the set of 
allowed states, one is taking the risk of finding a "vacuum" which has nothing 
to do with the physical one, but is only picked due to a specific form of the 
action of the Hamiltonian outside the physical subspace. There are two ways 
to approach this problem. One is to solve Gauss' law operatorially a la Dirac 
[71 IH]. This leads to a "completely gauge fixed" formalism. Once this is 
done any state can be chosen. The price to be paid is that the Hamiltonian 
in any completely fixed gauge is very complicated. The calculation of the 
expectation value of energy then is not analytically manageable. Another way 
is to let the Gauss' law constraint be, but write down trial states which are 
explicitly gauge invariant. The Hamiltonian in such a calculation is simple, 
but the trial states are usually very complicated. Thus the problem of gauge 
invariance is linked very strongly with the problem of calculability. 

2 The variational setup 

So given that the only path integral that we know how to calculate analyti- 
cally is that of a Gaussian wave function, what kind of progress can one make 
with variational calculations in QFT? Surprisingly enough in QFT without 
gauge symmetry a Gaussian ansatz can take one a long way. The famous 
BCS theory of superconductivity is nothing but a variational analysis of the 
interacting QFT using a Gaussian variational stated]. The Gaussian Ansatz 
has also been applied to self-interacting relativistic scalar and spinor theories 
jlUj . where it gives non-trivial exact results in the large N limit. The reason 
it works is quite simple. A Gaussian wave functional is the exact ground 
state in quantum field theories of non-interacting fields, massless or massive. 
As a trial state it is thus flexible enough to probe the existence of a mass 
gap in the spectrum. Therefore, whenever the main effect of the interaction 
is to dynamically generate (or significantly change) the mass of the parti- 
cles, the Gaussian Ansatz is adequate and informative. Put another way, 
one can hope that the Gaussian WF is useful if the non-perturbative physics 



is dominated by a single condensate, that of the lowest possible dimension. 
From this point of view, it would seem that, it is perfectly reasonable to try 
a similar variational Ansatz in the Yang-Mills theory. After all, it is strongly 
suggested by the QCD sum rules [T^ that the pure glue sector is strongly 
dominated by the lowest dimensional non-perturbative condensate {F"^). 

The difficulty comes from the necessity to impose gauge invariance. It 
is very easy to see that in a non-abelian theory it is impossible to write a 
Gaussian WF that satisfies the constraint of gauge invariance. The SU(N) 
gauge theory is described by a Hamiltonian 



H = i d'x 

where 

Efix) ■ ^ 



\Ef + \Bf 



(4) 



M?(a;) 



Bt{x) = -e,,,{d,Al{x) - d.A'^ix) + gr'^A%x)Al{x)} , (5) 

and all physical states must satisfy the constraint of gauge invariance 

G"(x)^[A] = [diE^ix) - gr^'A'l{x)E^{x)] ^[A] = . (6) 

Under a gauge transformation U — generated by C^i^x) — the vector poten- 
tial transforms as 

A^ix) -. Af^x) = S^\x)A'i{x) + Xtix) (7) 

where 

S^\x) = -tr (r"f/V^f/) ; A^(x) = -tr [r^'U^diU) (8) 

^ 9 

and r° are traceless Hermitian N by N matrices satisfying tr(r''r^) = 25"*. 
A Gaussian wave functional 

^At] = exp { - i I d'xd'y [A^{x) - C(a:)] 

■{G-')tS{^,y)[A%y)-CM] (9) 



transforms under the gauge transformation as 

vl/[4«]^vl;p^)^]. (10) 

In the abehan case it is enough to take diG^^ = to satisfy the constraint 
of gauge invariance. In the non-abelian case, however, due to the homoge- 
neous piece in the gauge transformation eq. (|7j), no gauge invariant Gaus- 
sian WF exists. Thus one has to abandon the notion of a simple Gaussian 
variational Ansatz on the Hilbert space spanned by the canonical variables 
{Al{x),Et{x)}. 

One option is to attempt to solve the constraint eq. (jHI) operatorially. 
Such a procedure leads to a "completely gauge fixed" formalism. For ex- 
ample if one solves eq. (jH)) for E^ and then performs Dirac quantization, 
one obtains the Hamiltonian in the axial gauge. Another popular choice is 
to express diEf in terms of the transverse components of electric field and 
vector potential. This leads to the Hamiltonian in the Coulomb gauge. In 
either case, after the Hilbert space has been reduced to the physical space, 
the Gaussian trial state in the remaining degrees of freedom can be con- 
sidered. However, the actual calculations are extremely difficult since the 
Hamiltonian in any completely fixed gauge is very complicated. Attempts 
to perform such a calculation in the axial gauge were made in fUl El • The 
problem encountered here is that the calculation is plagued by a spurious 
infrared pole of the form I//C3. The results depend very much on the way 
one treats this pole, and there is no clear understanding as to the proper 
way to do so. The Coulomb gauge calculation is even less straightforward, 
primarily because the Coulomb gauge is known to suffer from Gribov am- 
biguity jT8|. Thus the expression for the Hamiltonian that one obtains in 
this gauge is only formal, and has to be very carefully defined by considering 
zeros of the Fadeev-Popov operator. Non-perturbative analytic calculations 
in this gauge are therefore all but impossible ^Hl- Attempts have been made 
to perform numerical variational minimization taking account of the Gribov 
horizon [201 E], but the interpretation of the results is very difficult. 

Another route, pursued in [22], is to modify the Gaussian trial state in 
a "minimal" way to make it compatible with gauge invariance. Thus one 
considers the state 

■^[A] = exp I- j £x£yB'^{x)W''\x,y)B]{y)G,,{x ~ y)\ (11) 



where W"''^ is the Wilson path ordered integral along the straight line con- 
necting points X and y taken in the adjoint representation. Explicit intro- 
duction of the Wilson line indeed makes the state gauge invariant. Some 
semi-quantitative arguments were given in [22] to the effect that the best 
variational state of this type will have a short range variational function 
G{x — y) and therefore will be confining at large distances. However the 
state is so complicated that no reasonable way to get the actual calculation 
going was ever found. 

Finally, another series of works attempts to initially disregard the Gauss' 
law constraint, and subsequently calculate corrections due to its implemen- 
tation perturbatively [2S1 1211 12^1 I2ni 121] • A clear discussion of this method 
and the analogy with nuclear physics calculations is given in a recent re- 
view paper [2H]- Although the method is mathematically very elegant, it is 
not suited for systems whose energy may be lowered by a non-perturbative 
amount due to the exact implementation of Gauss' law. As we believe this 
to be the case for the Yang-Mills theory, we strongly doubt the usefulness of 
this method to obtain realistic results. 

In this review we will therefore restrict ourselves to the discussion of the 
approach proposed in |T3|. The scheme devised in J3] is rather straightfor- 
ward. One opts for strictly preserving gauge invariance, and constructs a 
gauge invariant wave functional by projecting the Gaussian wave functional 
of eq. (jnj) onto the gauge invariant sector. Restricting ourselves to the case 
of zero classical fields (C = 0), the variational Ansatz proposed in ^^ is 

v^[4"] = JdU{x) exp ^-^-ld'xd'yA^^{x)GT^'''\x,y) Af\y)^ (12) 

with Af°- defined in eq. ([7|) and the integration is performed over the space 
of special unitary matrices with the SU{N) group invariant measure. 

The trial state defined in this way is explicitly gauge invariant. It is not 
obvious at this stage that we will be able to deal with it analytically. But 
as we will see in the following, some headway can be made using analytical 
approximation schemes to calculate various expectation values in this state. 
In this review we will discuss the applications of this technique to the pure 
Yang-Mills theory at zero as well as at finite temperature. 

But before plunging head first into the discussion of non-abelian Yang- 
Mills theories, we will consider the much simpler, but nevertheless informa- 
tive, case of a compact U(l) gauge theory in 2+1 dimensions [22] • The theory 



8 



is known to be confining, and our aim is to see whether the confining prop- 
erties of the ground state can be captured with our simple Ansatz. It will 
also illustrate how the use of different field theoretical techniques allows one 
to carry out the variational estimation of the ground state. 

3 Compact QED in 2+1 dimensions 

We start by setting up the Hamiltonian description of U(l) compact QED. 
First of all, we need to determine what the Hilbert space of admissible states 
is. It is clear that Gauss' law should be implemented, and thus all the physical 
states should satisfy 

exp{,/.'.a,,W^.w}|*)H*) (13) 

for an arbitrary continuous function (f){x). This is true for both compact 
and non-compact QED. There is however a crucial difference between Gauss' 
law in the compact theory and in the non-compact one. In the non-compact 
theory eq. (jT^ should be satisfied only for regular functions 0. For example, 
the operator 

nx)=exp{l|Ae,|^^.(|/)} (14) 



which has the form of eq. ()T3|l with the function (j) proportional to the planar 
angle 9, i.e. = -9{x), does not act trivially on the physical states. In fact, 
this operator creates a point-like magnetic vortex with magnetic fiux 271 /g 
and therefore changes the physical state on which it acts. 

In the compact theory the situation in this respect is quite different. 
Point-like vortices with quantized magnetic flux 2im/ g cannot be detected 
by any measurement. This translates into the requirement that the creation 
operator of a point-like vortex must be indistinguishable from the unit oper- 
ator. In other words, the operator eq. (J14p generates a transformation which 
belongs to the compact gauge group, and should therefore act trivially on all 
physical states. Eq. (fT^ should therefore be satisfled also for these operators. 

Accordingly, the Hamiltonian of the compact theory must be also in- 
variant under these transformations. The magnetic fleld itself, deflned as 
B = eijdiAj, does not commute with V{x) (cf. ref. [TT]). 

V^{x)B{y)V{x) = B{y) + —5\x - y) . (15) 

9 



The Hainiltonian should therefore contain not B^ but rather a periodic func- 
tion of B. We will choose our Hamiltonian to be 

H = la'J2El,--^^Y.^osga'B^. (16) 

Since we will need in the following an explicit ultraviolet regulator, we 
switched temporarily to lattice notations. Here a is the lattice spacing, and 
the sums are respectively over the links and plaquettes of the two-dimensional 
spatial lattice. The coefficients of the two terms in the Hamiltonian are cho- 
sen so that in the weak coupling limit, upon formal expansion to lowest order 
in g"^, the Hamiltonian reduces to the standard free Hamiltonian of 2+1 di- 
mensional electrodynamics. Following Polyakov [3T], we work in the weakly 
coupled regime. Since the coupling constant g'^ in 2+1 dimensions has di- 
mension of mass, weak coupling means that the following dimensionless ratio 
is small 

/a < 1. (17) 

Our aim now is to find variationally the vacuum wave functional of this 
theory. 

3.1 The variational Ansatz 

As our variational trial ground state we choose the Gaussian wave function 
of A projected onto the gauge singlet. The projection has to be performed 
with respect to the full compact gauge group of eq. (fT^ . 

To facilitate this, we define a vortex field A^{x) that satisfies (we suppress 
the lattice spacing a henceforth) 

V X A^{x) = —6\x) , V ■ A^ = . (18) 

9 

This is the vector potential corresponding to a magnetic field that is zero 
everywhere except at a; = 0, where it takes the value — . The explicit solution 
of eq. (UHl) is 



1 Xj 

9 x^ 



Ar{x) = ~-e.,^^. (19) 



The compact gauge invariance requires the variational wave function ip[A] to 
be invariant under shifts A -^ A + A^ . This is, of course, consistent with the 



2tt 



periodicity of H under B ^ B + 
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Hence we define a field, shifted by a non-compact gauge transformation 
^n and by an integer valued vortex distribution m{x), 



j^i^,m) ^^^ ^ ^f^^^ _ y^(^) _ J2 m{y)A^{x - y) 



(20) 



or, for short, 

A^"^'"^) = A - V0 - A^ ■ m . (21) 

We choose the gauge invariant and periodic trial wave function as 

m]= E jm^^V -\jd^x(fyAf^^G-\x~y)Af^^ . (22) 

{m(x)} 

Under a gauge transformation, 

^[A + VX]=^[A] (23) 

since A can be absorbed into a shift in 0. The simple rotational structure of 
Gij = 6ijG that appears in the variational wave function eq. (|22p is consistent 
with perturbation theory. We also take G{x) to be a real function. 

Our task now is to calculate the expectation value of the Hamiltonian in 
this state, and to minimize it with respect to the variational function G. We 
start by considering the normalization of the wave function. 

3.2 The normalization integral 

The norm of \ih) is 



{m,m'} 



X exp 



2 



exp 



2 



• (24) 



We shift A by V0' + A^ ■ m' and absorb the shift into and m, obtaining 
Z = Y, fmidA] e-iA(^-)G-UW.™)g-iAG-U _ (25) 



{m} 
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Now we combine the exponents according to 



j\{4',m)Q-lj^i<f>,m) _|_ j\Q-^j\ — 2^('/'/2.™/2)(^-l^(</'/2,m/2) 



+ -S{<P,m)G-'S{<l),m), (26) 



where 



V 



5 = V0 + A^ -m 



(27) 



It is easy to see that SG ^S contains no cross terms between m and 0. We 
shift A by V0/2 + A^ ■ m/2, and all the fields decouple. We have then 



Z — Zj^Za,Z^ , 



where 



Za = det 7tG , 

z, = jme-i-'f-^-'-''^ 

1 



[^0] e-|V0.t.-.v0 _ I ^g^4^_^ 



V2 



1/2 



exp 



Wn'} 



%2 



(i a;(i ym{x)D(x — y)m{y) 



(28) 

(29) 
(30) 

(31) 



Here, Z„ is the vortex "partition function" , with the "vortex- vortex interac- 
tion" D given by 

D{x' - y') = g^ I d^xd^yA'^ix - x') ■ G'^x - y)A^{y - y') . (32) 



We can split off the x = y terms in eq. (jHT|) and write 

1 



[dm{x)] exp 



4g2 



m{x)D{x — y)m{y) 



x^y 



n 



,m(y)^ 



(33) 



-■ y 



where we have defined the vortex fugacity 



^ = e 49 



^DiO) 



(34) 



We expect the ultraviolet behaviour of the variational function G to be 
the same as in the free theory, viz. (for the Fourier transform). 



G-\k) ^k. 



(35) 



12 



so that 

/"A j2l a 2 

m-J J^2^G-\k)^2nA (36) 

and thus 

z ~ e"^^ , (37) 

where we use the momentum space cutoff A = a~^. 

In the wealf couphng region we have 2; -C 1, which allows us to restrict 



possible values of m to 0, ±1 in eqs. ()3ip33|) . 

3.3 Expectation values 

We are now ready to calculate the expectation value of the Hamiltonian 
eq. (fTH|) . Using the definition eq. (j^ . we obtain 



V-'( I E^£x) = -^(i; 



Q2 
2^ flA2 



dA-^, 



i, 



where K{k) is the correlation function of the vorticity 

K{k)= f (fxe'''''{m{x)m{0)) . (39) 



To calculate correlation functions of m we use a duality transformation 
Pm 1221 inni E11- We add an iJ ■ m term to the exponent in eq. (j^ and use 
the formula 

to obtain 

Z.[J]= f[dx]e-^"^-''^'-^l[[l + 2cos{xix) + J{x))]. (41) 

Noting that ^ 

cos(x + J) = (cos x)o : cos(x + J): = z: cos(x + J) ■ (42) 

^The normal ordering is performed relative to the free theory defined by the quadratic 
action in eq. H41|l . 

13 



we have 

Z, = jidx] e-^'^-^'^-^ \{[l + 2z :cos(x + J) :] 

~ / -D^exp —g'^xD^^X + '^z / d^^ :cos (x(x) + J(x) j : 

Correspondingly PU] , 

{m{x)m{y)) = 2g^D-\x - y) - V {D-\{x) D-\{y)) 

and 

ir(A;) = 2^ + 0(^2), 

which, in this approximation, does not depend on momentum. 

The propagator of x is also easily calculated. To first order in z, 

1 



(43) 

(44) 
(45) 



d'xe^'^{x{x)xm 



D{k) D\k) 



29' 



•29' 



+ 0{z''). 



(46) 



Thus we have 



<''>' G-Hk) - K. 



(27r 



9 






k-'G-\k) . (47) 



The magnetic part is easily calculated since it has an exponential form 
and, therefore, with our trial wave function leads to a simple Gaussian inte- 
gral. We find 

d^k 



l^^ingB^^ = exp 



-'-2 2 



(27r) 



-yG{k) 



/ imTmn\ 



(48) 



The second factor is different from unity only for odd values of n. Using 
eq. (jlH|l we find (e*'"") = e~^^. Expanding to leading order in g'^ and z, we 
get 

--^ cos gB) = ] [ ^,k'G{k) + ^^z , (49) 



9 



(2vr) 



9 



where we have dropped an additive constant. Finally, the expression for the 
variational vacuum expectation value of the energy to first order in z is 



V^ ' 4 



d^k 



G-\k) + k'G{k) 



An' 



z k-'G-\k) - — 



TT" 



• (50) 
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3.4 Determination of the ground state 

The expression eq. fl50p has to be minimized functionally with respect to 



(51) 



G{k). 


i-rom eas. (|34l3(i|) 


we find 












6z 
6G{k) 


- ^ k- 


^^G~\ 


The variation of eq. 


m 


gives ^ 








e - G-\k) = 


47r^ 
9' 


zk~'G~ 


\^)j 


<Pp 

(2vr)2 



Eq. (|^ has the solution 



p-'G-\p) 



vr^ 



(52) 



where 



47r^ 



m 



G-\k) 



dPk 

(2^ 



A;2 + im? 



k-^G~\k) - 



vr^ 



(53) 



(54) 



The main contribution to the integral in the gap equation, eq. ()54|1 . comes 
from momenta k'^ ^ m^. For these momenta k'^G~'^{k) = 1. Therefore, we 
see that eq. (jH^ has a non-trivial solution. Using eqs. ()34|3fip5Hj) we obtain 



m 



47r^ 
-^exp 



7r 



(fp 



(Pk 



e 



^2 _|_ ^ 



which for (7^^ = g^ J K <^ 1 can be simphfied to (cf. eq (jHTj) ) 



vr^ 



m 



47r^ 



TT' 



-4)A^ 



^' 



exp 



ttA 



(55) 



(56) 



where we have restored the ultraviolet cutoff dependence explicitly. The 
resulting m is the mass gap of the theory, in the sense that it is the inverse 



^We have dropped a term —^^zk ^G ^(fc) from the right-hand side of eq. (|52|l since 

it is smaUer by a factor of ^^ than the term retained when one assumes G ^ k^^ at large 
k. 
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of the spatial correlation length. Calculating, for example, the propagator of 
magnetic field, we find 

and at large distances (neglecting power-like prefactors), 

V^Gix) = - [ -^{k^ + m2)V2e^k.x _ ^-m. ^gg) 

This dynamically generated mass is Polyakov's result ^JE\ [SU]. Thus, we 
recover in the Hamiltonian approach the first important result known about 
compact QED — a finite mass gap m, as well as its correct dependence on 
the coupling constant. 

3.5 Spatial Wilson loops 

We also want to see whether the charges are confined in our best variational 
state. The simplest quantity that is related to confinement is the expectation 
value of the Wilson loop. Therefore, we will calculate it in our ground state 

Wi[C] = /exp (ilg / A ■ t/x j \ = /exp (ilg f B dSj\ , (59) 

where / is an integer and the integral is over the area S bounded by the loop 
C. We have 



(60) 



Wi[C] = /jje^'™"\z^^ f DAexp(~AG-^A + ilg I B dS 

The second factor is a Gaussian integral, which gives 

Wa = exp r^ j d'x j d'y V'Gi^ - y)\ . (61) 

In the limit of large S the leading behaviour of the exponent is 

--g^i: lim eG{k) = --g^m^ . (62) 



16 



This gives the area law with the string tension 



a 



-g^m . 



(63) 



The first factor in eq. ()60|) is different from unity only for odd /. It can be 
easily calculated but gives only subleading corrections to the string tension 



3.6 Potential between external charges 

We have thus found that in the best variational state the Wilson loop has an 
area law behaviour. This usually signals confinement, and one is naturally 
inclined to conclude that we have indeed found confinement with the string 
tension related in the expected way to the dynamically generated scale, a oc 
g^m. One must be however a tad more careful at this stage, since the spatial 
Wilson loop does not directly give the potential between external charges. 
Although in the Euclidean formulation there is no difference between spatial 
and time-like Wilson loops, in the Hamiltonian approach this is not obvious. 
It is, therefore, desirable to calculate directly the potential between external 
charges. 

How does one do this? Obviously one has to introduce into the theory 
the source corresponding to the pair of external charges. The result is a 
modification of Gauss' law, 



d,E, 



X] 



gp{x) 



(64) 



with p{x) = S{x — Xi) — S{x — X2). As the external charges do not have 
dynamics of their own, the Hamiltonian remains unchanged. The seemingly 
simplest option seems to be to take the same Gaussian variational state which 
minimizes the energy in the vacuum sector, and project it with the modified 
projection operator corresponding to the new Gauss' law. This "minimally 
modified" state would be 



\m{x)} 



+ Wp{x)^{x) 



■ (65) 
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One could then calculate the energy expectation value in this state and take 
this as an estimate of the interaction potential. This procedure was suggested 
in [^ EHl EH]- However it turns out that the estimate obtained for the 
interaction energy in this way is very unreliable. For example, in the compact 
QED3 case this calculation was performed in ^] with the resulting energy 
being not even infrared finite. Instead, to get a reasonable estimate for the 
energy one has to introduce additional variational parameters. In f^D^ the 
variational Ansatz was extended in the following way: 



^\A] = J2 /[^<^] ^^P -II d'xd'yAi*'"^^G-\x - y)Alf' 



{Tn{x)} 



m) 



+ iei{x)Ay'"^' + igp{x)(t){x) 



(66) 



Here, the "classical" electric field profile ei{x) is to be varied so as to mini- 
mize the energy. It turns out that with this extra variational parameter the 
calculation gives very satisfactory results jHH. Without giving the details of 
this calculation, we note that it confirms the expectation that the potential 
between external charges is a linear function of the separation. The string 
tension calculated this way coincides within ten percent with eq. (|UHj). 

What have we learned from this toy model? First of all it is very encour- 
aging that the Gaussian projected Ansatz is good enough to reproduce all 
known results and, in particular, confinement and dynamical mass generation 
with parameterically correct values of string tension and mass. 

We have also seen that we need some ingenuity to be able to carry out 
the calculations with the variational wave function. We were able to do it 
in this simple case as the coupling constant was small and we could utilize 
existing methods for treating weakly interacting two dimensional systems. It 
is quite clear that in 3+1 dimensions no such techniques will be available and 
consequently the situation will be, in this respect, much more complicated. 
Finally, we learned that we could not impose the Gauss' law constraint on the 
state perturbatively. Perturbatively, the operator V — eq. (fT^ — simply 
vanishes. However, without including it into the projection procedure of 
the Gaussian Ansatz, we would not get any non-trivial results — we would 
have found the vacuum of a free massless photon without a dynamical mass 
gap and with vanishing string tension. We fully expect that this aspect will 
remain important in the application to QCD, to which we turn in the next 
section. 
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4 The Yang- Mills theory 

The dynamics of the pure glue sector of QCD is described by the Yang-Mills 
(also often called gluodynamics) Hamiltonian 



H = I d'x 

where 

Ef(x) 



^Ef + ^Bf 



(67) 



dA-ix) 

1 

— ( 
2 



Bt{x) = -e,,, {d,Al{x) - d.A'^^ix) + gr"^A]{x)Al{x)} (68) 



and all physical states must satisfy the constraint of gauge invariance 

C^ixMA] = [d^Efix) - gr'"^A^{x)E^{x)] ^A] = . (69) 

We thus have to choose a set of variational states which are invariant 
under the action of eq. (jBHI)- We start then with a Gaussian state 

^i^o[A^] = exp!^-^ld'xd'yA1{x){G-')^^{x,y)A%y)^ , (70) 

where the set of functions G^^f^{x) are variational parameters. 

We hope that the freedom allowed by variation of G~^ is sufficiently 
wide to probe the non-perturbative physics of the Yang-Mills vacuum. As 
discussed in the introduction, the states of the form eq. (fTOIl are not gauge 
invariant and, therefore, as such do not belong to the physical Hilbert space 
of gluodynamics. To remedy this problem we project the states onto the 
gauge invariant subspace by gauge transforming them and integrating over 
the whole gauge group '^: 

nA^] = j DU{x)exY> \-\j^ A^\x)GT''^\x,y) Af{y)^ (71) 

with A^°' defined as 

Af'^(x) = S'^\x)A\{x) + X^x) , (72) 



■^Hereafter we use the notational shorthand J = J (P 



X 
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and 

S''\x) = hi [t'^U^t^U) ■ \1{x) = -tr (r^U^d^U) (73) 

^ 9 

with r — traceless N x N hermitian matrices — normalized by 

tr(rV^) = 25"^^ , (74) 

The integration in eq. (ffT|l is performed over the space of special unitary 
matrices with the SU{N) group invariant measure. This integration projects 
the original Gaussian state onto a colour singlet. Due to the projection 
operation, the calculation of expectation values in this state is much more 
involved than in the case of a simple Gaussian. A full functional minimization 
with respect to the variational functions G^-^"*(a;, y) is, therefore, beyond our 
calculational abilities. 

In order to render calculations possible, we will impose several restric- 
tions on the form of G~^, which will lead to considerable simplifications. 
First, we require the state to be translationally invariant, that is we assume 
that Lorentz symmetry is not spontaneously broken in the ground state, 
restricting the form of G"^ 

G-\x,y)=G-\x-y). (75) 

Further, we will only consider matrices G of the form 

Gt^ix-y)=6'^%,Gix-y). (76) 

This form is certainly the right one in the perturbative regime. In the leading 
order in perturbation theory, the non-abelian character of the gauge group is 
not important, and the integration in eq. ()7ip is basically over the [/(l)^ ~^ 
group. The 6""^ structure is then obvious — there is a complete democracy 
between different components of the vector potential. The 6ij structure arises 
in the following way. If not for the integration over the group, G^-^ would 
be precisely the (equal time) propagator of the electric field. However, due 
to the integration over the group, the actual propagator is the transverse 
part of G~^. It is easy to check that the longitudinal part diG~^ drops out 
of all physical quantities. At the perturbative level, therefore, one can take 
Gij ~ 5ij without any loss of generality. Outside the perturbative framework 
eq. (fTUj) is a genuine restriction on the Ansatz, and we will adopt this form 
of the matrix G in order to simplify our variational calculation. 
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We can use additional perturbative information to further restrict the 
form of G. The theory of interest is asymptotically free. This means that 
the short distance asymptotics of correlation functions must be the same as 
in perturbation theory. Since G~^ in perturbation theory is directly related 
to correlation functions of gauge invariant quantities (e.g. E"^), we must have 

G-\x)^-^, x^O. (77) 

x^ 

Finally, we expect the theory non-perturbatively to have a gap. In other 

words, the correlation functions should decay to zero at some distance scale 

G(x)~0, x>^. (78) 

We will build this into our variational Ansatz in a fairly naive way. We 
will take M to be our only variational parameter. This can be done by 
choosing for G{x) a particular form that has the UV and IR asymptotics 
given by eqs. (f77j) and (jTSj). like, for example a massive scalar propagator 
with mass M . We find another parameterization slightly more convenient. 
The form that will be used throughout this calculation has the following 
Fourier transform: 

^ (^) = 1m iik^<M^ ■ ^^^) 

Using a massive propagator instead, changes the results very little. Eq. (ffTjl . 
together with eqs. ()7fip79j) . defines our variational Ansatz. 

We now have to calculate the energy expectation value in these states 
and minimize it with respect to the only variational parameter left — the 
scale M. Note that the perturbative vacuum is included in this set of states 
and corresponds to M = 0. A non-zero result for M would therefore mean 
a non-perturbative dynamical scale generation in the Yang - Mills vacuum. 
Now we have to face up to the question of how to calculate averages in the 
state eq. (f7T|). 
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4.1 The effective cr- model 

The expectation value of an arbitrary gauge invariant operator O is given by 
the functional integral 

{0) = ^JdUDU'{0)a, 

where Z is the norm of the trial state. Two simplifications are immediately 
obvious. First, for gauge invariant operators O, one of the group integrations 
is redundant. Performing the change of variables A -^ A^ (and remembering 
that both integration measures DU and DA are group invariant), we obtain 
(omitting the volume of SU{N) factor J dU) 

{0) = ^JdU{0)a. 



{0)a= I DAe-5/-^^»''"(")^"'("-^)^»''"(^) 



Also, since the gauge transform — eq. ()72|l — of a vector potential is a linear 
function of A, for fixed U{x) this is a Gaussian integral, and can therefore 
be performed explicitly for any reasonable operator O. We are then left only 
with a path integral over one group variable U{x). But this is not easy. 

Let us consider first the normalization factor Z. After integrating over 
the vector potential we obtain 

Z= I DU exp{-T[U]} (82) 

with 

r[U] = -TT\nM + -X[G + SGS^]-^X, (83) 

where multiplication is understood as the matrix multiplication with indices: 
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colour a, space i and position (the values of space coordinates) x, i.e. 



y 



XOX= / X1{x)0Z{x-y)X'^{y). (84) 

Jx,y 

The trace Tr is understood as a trace over all three types of indices. In 
eq. (jSH|) we have defined 

S^^ix,y) = S'^\x)6i,6ix-y), 
Mt^ix^y) = [S^^'=ix)S'^\y) + 5^']G-\x - y)6,, (85) 

where S^^x) = ^tr (t^Wt^U) and X^{x) = hi (r^WdiU) were defined in 
eq. (f73j) and tr is a trace over colour indices only. Using the completeness 
condition for SU{N) 

one can see that S""^ is an orthogonal matrix 

S^'S^' = Ky,, (Ur-U^)^^ iUr''U^)ki = ^^r (rV^) = S'^' . (87) 

We have written eq. ()83p in a form which suggests a convenient way of 
thinking about the problem. The functional integral eq. (J82|l defines a parti- 
tion function of a non-linear sigma model with the target space SU{N)/Zn 
in three dimensional Euclidean space. The fact that the target space is 
SU{N)/Z^ rather than SU{N) follows from the observation that the action 
eq. (jHn|) is invariant under local transformations belonging to the centre of 
SU{N). This can be trivially traced back to invariance of A'} under gauge 
transformations that belong to the centre of the gauge group. 

Note that the quantity U{x) has a well defined gauge invariant meaning, 
and it is not itself a matrix of a gauge transformation. A contribution of 
a given U{x) to the partition function eq. (|H^ and to other expectation 
values corresponds to the contribution to the same quantity from the off- 
diagonal matrix element between the initial Gaussian and the Gaussian gauge 
rotated by U{x). Consequently, if matrices U{x) which are far from unity 
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give significant contribution to the partition function, it means tliat tlie off- 
diagonal contribution is large, and therefore that the simpleminded non gauge 
invariant Gaussian approximation (which neglects the off diagonal elements) 
misses important physics. 

The action of this sigma model is rather complicated. It is a non-local and 
a non-polynomial functional of U{x). There are, however, two observations 
that will help us devise an approximation scheme to deal with the problem. 
First, remembering that the bare coupling constant of the Yang-Mills theory 
is small, let us see how it enters the sigma model action. It is easy to observe 
that the only place it enters is in the second term in the action eq. ()83|). 
because A°(x) has an explicit factor 1/g. Moreover, it enters in the same way 
as a coupling constant in a standard sigma model action. We can, therefore, 
easily set up perturbation theory in our sigma model. With the standard 
parameterization 

[/(x) =exp|z|0V"} (88) 

one gets A^(a;) = —di(j)"'{x) + 0{g), S"'''(a;) = (5"^ + 0(5') and the leading order 
term in the action becomes 

4 [ d,rix)G-\x-y)d,riy)- (89) 

This is a free theory albeit with a non-standard propagator which at large 
momenta behaves as 

Z}(fc)~G(fc)l~^. (90) 

It is easy to see that in this sigma model perturbation theory the cou- 
pling constant renormalizes logarithmically. The first order diagram that 
contributes to the coupling constant renormalization is the tadpole. In a 
sigma model with a standard kinetic term this diagram diverges linearly as 
J d^k/k'^, a sign of perturbative non-renormalizability. In our model, how- 
ever, due to a non-standard form of the kinetic term eq. (jM?]) . the diagram 
diverges only logarithmically as J d^k/k^. The form of the /3-function is, 
therefore, very similar to the /5-function in ordinary QCD perturbation the- 
ory. It is a straightforward albeit tedious matter to calculate the one loop 
graphs in the cr-model perturbation theory and to extract the renormalization 
of the coupling constant jH] . The result is 

f^i9) = -j£y,^N . (91) 
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For comparison, the pure Yang-Mills /3-function is 

The two almost coincide. The first contribution to the Yang- Mills /5-function 
which is reproduced by eq. (j^ is due to the longitudinal gluons, or in other 
words to the implementation of Gauss' law. Since we have implemented 
Gauss' law exactly on our trial wave function, this (major) part of the (3- 
function is correctly reproduced by the cr-model renormalization group. The 
second contribution, which is not present in eq. (HD), is due to the dynamics 
of transverse gluons. The fact that this contribution is missing in the cr- 
model suggests that our Ansatz is not perfect in the ultraviolet. However, as 
this contribution is relatively small, we will not be discouraged at this stage. 
We will simply think of eq. (|91|) as representing the complete one loop Yang- 
Mills /5-function, keeping in mind that it would indeed be very interesting to 
eliminate this discrepancy by perhaps exploring a less simplistic form of the 
variational propagator G(/c) [HTj . 

Since the cr-model is asymptotically free, perturbation theory becomes 
worse and worse as we go to lower momenta, and at some point becomes 
inapplicable. 

Now, however, let us look at the other side of the coin: let us see how the 
action looks for the matrices U{x) which are slowly varying in space. Due to 
the short range of G{x), for U{x) which contain only momenta lower than 
the variational scale M the action is local. In fact, with our Ansatz eq. (|79p. 
it becomes the standard local action of the non-linear cr-model 

^l[U] = ^tr f d,U\x)d,U{x) + . . . (93) 

In this low-momentum approximation we also neglected the space depen- 
dence of Sfj{x) in the term SGS^ in eq. (|HHj): then, using the fact that S is 
an orthogonal matrix eq. (jEZj), one gets SGS'^ -^ G. 

Strictly speaking, due to the Z^ local symmetry of eq. (jHSl), the action 
for the low momentum modes is slightly different. The derivatives should be 
understood as Z^ covariant derivatives. The most convenient way to write 
this action would be to understand U{x) as belonging to U{N) rather than 
SU{N) and introduce a f/(l) gauge field by 

Tl = ^^tr [{d, - thi)U\x){d, + ih)U{x) . (94) 

2r Jx 
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This defines a sigma model on the target space U{N)/U{1), which is isomor- 
phic to SU{N)/Z]\r. This subtlety is unimportant for large A^ and will not 
play a significant role in our analysis. 

The action eq. (j^ does not look too bad. Even though it still cannot 
be solved exactly, it is amenable to analysis by standard methods, such as 
the mean field approximation, which in 3 dimensions and for large number 
of fields should give reasonably reliable results. 

We adopt therefore the following strategy for dealing with the integra- 
tion over the SU{N) group. We integrate perturbatively the high momentum 
modes of the field U{x). This is the renormalization group (RG) transfor- 
mation. We would like to integrate out all modes with momenta fc^ > M^. 
This procedure will necessarily generate a local effective action for the low 
momentum modes. At the same time, because of the (presumed) equivalence 
of the RG flows in QCD and our effective sigma-model, the effective coupling 
constant will be the the running QCD coupling constant aqcniM) at scale 
M. This part of the theory can then be solved in the mean field approxima- 
tion. Clearly, in order for the perturbative RG transformation to be justified, 
the QCD running coupling constant at the scale M must be small enough. 
Our procedure makes sense provided the energy is minimized at a value of 
the variational parameter M for which 

«qcd(M)<1. (95) 

We will check whether this consistency condition is satisfied at the end of the 
calculation. In the next section we will calculate the expectation value of the 
Hamiltonian in the lowest order of this approximation scheme, and perform 
the minimization with respect to M. 

4.2 Solving the variational equations — dynamical mass 
generation 

We will now calculate the energy, i.e. the expectation value of the Hamil- 
tonian eq. (j67|) . We first perform the Gaussian integrals over the vector 
potential at fixed U{x). Let us consider, for example, the calculation of the 
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chromoelectric 


energy: 
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6At{x) 


SAtix)/ 
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-f 


G-Hx 


' 



G-\x - y)G-\x - z){At{y)At{z))A ■ (96) 

J x,y,z 

The Gaussian averaging over A is easily performed. Defining for convenience 

at{x)= I \\{y)G-\y-z)S'\z){M~'r{z,x) (97) 

one gets 

f{Ef)A = 3iN^-l) I G-\x,x)- f{G-^M-^G-')l^{x,x) 

Jx Jx Jx 

at{x)G-'{x-y)at{y). (98) 



'x,y 

For the chromomagnetic contribution the calculation is straightforward and 
one gets 



{ie,,,d,Aiy)A = ie,,,d,aty + e,,,eumd:dfiM-rk:nix,y)\x=y, (99) 



a Ab AC \ J5 „a „b „c , o „a/ \ /i — l\bc 



{d,AlA',Ai:)A = d.alay^ + d,at{M 



)lm.\-^-i 



X,X] 



+ a\&^{M-riL{x,y)l=y + cil,d^^M-')t{x,y)l=y, (100) 
and 

fabc fade / Ab AC /\d AC \ _ o f a^c /ade fe c d e 
tijktilmj J \^j^k^l^m/A — ^J J (^j(^k^l<^m 

+ sr^T'^^aYiiM-yix, x) 

+ ur^'T'^^iM-^f^x.x) .{M~^y\x,x) (101) 

Here, we have used the obvious notation Mf^ = Ai"'''6ij. The next step is 
to decompose the matrix field U{x) into low and high momentum modes. In 
general this is a non-trivial problem. However, since we are only going to 
integrate over the high momenta in the lowest order in perturbation theory, 
for the purposes of our calculation we can write 

U{x) = Ul{x)Uh{x) (102) 
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where Ul contains only modes with momenta k^ < M^, and Uh has the 
form Uh = 1 + igT"'(f)'^ and (pn contains only momenta k'^ > M^. This 
decomposition is convenient, since it preserves the group structure. Also, 
since the measure DU is group invariant, we can write it as DUlDUh- With 
this decomposition we have: 

X^x) = S'k\x)Xl{x) + X'^ni^) . (103) 

Further simplifications arise, since we only have to keep the leading piece in 
(plj. In this approximation: 

S^\x) = Sl\x) , 
M'"'{x,y) = 26'''G-\x-y), 
X%x) = XUx) + X^x) , 

atix) = \xt^{x) + \x\h{x)SI\x) . (104) 

The chromoelectric part of the energy can then be written 



L 



{Ef)A= ^^^l ^^ Ig-\x,x)-- I XUx)G~'{x-y)Xtdy) 

-\l Kh{^)G-\x - y)Xtn{y) ■ (105) 



The cross term vanishes since to this order, as we shall see, there is a de- 
coupling between the high and the low momentum modes in the action, and 
therefore the product factorizes, and also {X^f^) = 0. The Ansatz eq. (f7^ 
allows us to simplify this expression further. Recalling that Xl{x) contains 
only momenta below M, it is immediate to see that 



X^^G-^x - y)XUy) = M' / XUx)XUx) • (106) 

x,y J X 

We can then rewrite eq. (jl05|) as 

— / Kl{^)Kl{^) - 4 / Kh{^)G-\x - y)XtM . (107) 



h 
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The contribution of the magnetic term to the energy is very simple. All cross 
terms between the low and high momentum modes drop out. Some vanish for 
the same reason as the cross terms in eq. ()1U5|) . and others because they are 
explicitly multiplied by a power of the coupling constant. Since our approx- 
imation is the lowest order in g, except for the non-analytic contributions 
that come from the low mode effective action, those terms do not contribute. 
In fact, the entire low momentum mode contribution drops out of this term. 
The reason for this is that the only term which could give a leading order 
contribution, i.e. 



'■■> 



M^kd.XlL? (108) 



can be rewritten as 



(/5l)' + 0(/), (109) 

where ff^i^ is the "magnetic field" corresponding to the "vector potential" 
X^j^. However, A^ has the form of a pure gauge vector potential. Therefore 
fijL ~ 0; and the contribution of this term is higher order in g'^. One can 
check a posteriori that including this term indeed changes the energy density 
in the best variational state by a small amount (0(10%)), but has no effect 
at all on the best value of the variational parameter M. The entire magnetic 
field contribution to the energy is then 

1 1 A^^ — 1 
^{Bt )a = -^{(^ijkdjXlH) H ^ 

The last step is to perform an averaging over the [/-field. For convenience, 
we rewrite the complete expression for the energy density (here V = J d?x 
is a space volume) 

-^l {>^h{^)G'\x - y)KH{y))u + \{{e.,ud,\lHf)u 



-{Bf)A = ^{e^^AKn? + ^—d^d\G{x - y)l=y . (110) 



{KLi^)KLi^))u (111) 



_M2 

where the averaging over the [/-field should be performed with the sigma 
model action eq. (jHH|) . In our approximation this action has a simple form. 
Using eq. ()104|) we obtain 

r = T / KHi^)G-\x - y)xuy) + -J / Kl{^)Kl{^) ■ (112) 

^ J x,v ^ Jx 
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The low inomentum mode part is precisely equal to T^ in eq. (|94p . The only 
difference is that the coupling constant that appears in this action should be 
understood as the running coupling constant at the scale M. This obviously 
is the only 0(1) effect of the high momentum modes on the low momentum 
effective action 

^ ^ ii [{d,-ih)U\x){di + ih)U{x). (113) 



2g\M) 

We are now in a position to evaluate the VEV of energy. The contri- 
bution of the high momentum modes is immediately calculable. Using the 
parameterization Uh{x) = 1 — ^g(j)°'T"', we find that (p"" are free fields with 
the propagator 

{r{x)ny)) = 26'^'[d!dyG-\x-y)]-' (114) 

Also to this order Xini^) = (9j0"(x) and therefore eijkdjXIfj = 0. Using 
eq. ()114|1 one finds 



- £ {X^MG-\x - y)XtM)u = ^^— J^^ J^sG-\k) (115) 



where A is the ultraviolet cutoff, and the contribution of the high momentum 
modes to the energy (first two lines in eq. pill) ) is 



^ - (^^ - ^) {fl^ [^~^(^) + ^^^(^)] + I f |^^"^(^) 



^^-^^^\^.. 



27r2 , ,0 



3„. k^ 

-M+ — 

2 M 



A 

+ 2 / k^dk 

'm 



= ^^M' + .... (116) 

Terms omitted in eq. pi(j|l depend on A, but are independent of the varia- 
tional scale M. 

We now have to evaluate the contribution of the low momentum modes. 
It is clear from the form of the action eq. ()113|) that this contribution, as a 
function of M, will not be featureless. The most convenient way to think 
about it is from the point of view of classical statistical mechanics. Compar- 
ing eqs. (jlll|) and (J113j) . we see that we have to evaluate the internal energy 
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of the sigma model (with the UV cutoff M) at a temperature proportional 
to the running coupling constant g'^{M). For large M, the coupling con- 
stant is small, which corresponds to the low temperature regime of the sigma 
model. In this regime the global SU{N) ® SU{N) symmetry group of the 
model is spontaneously broken. Lowering M, we raise g'^{M), and therefore 
the temperature. At some critical value gc, the a-model undergoes a phase 
transition into the unbroken (disordered) phase. Clearly, in the vicinity of 
the phase transition all thermodynamical quantities will vary rapidly, and 
therefore this is a potentially interesting region of coupling constants. 

Before analysing the phase transition region let us calculate E{M) for 
large M. In this regime the low momentum theory is weekly coupled. The 
calculation is straightforward, and to lowest order in g^ gives 

1 AT^ — 1 

-M'{Xt,{x)XtL{^)) = --^M^ . (117) 

4 IZTT^ 

Putting this together with the high momentum contribution, we find 

-M^ M>Aqcd. (118) 



EiM) N' - 1 ^ . 



V 1207r2 

This indeed is the expected result. The energy density monotonically in- 
creases as M'^, with a slope which is given by the standard perturbative 
expression. Note, however, that the slope is very small, and the contribution 
of the low momentum modes to the energy is negative. Therefore, if the in- 
ternal energy of the sigma model grows significantly in the phase transition 
region, the sign of E{M) could be reversed"^ and the energy will then be 
minimized for M in this region. 

To see, whether this indeed happens, we will now study the low momen- 
tum sigma model in the mean field approximation. We rewrite the partition 
function by introducing a (hermitian matrix) auxiliary field a which imposes 
a unitarity constraint on U{x) 

Z = f DUDaDbiexp{-T[U,b,a]) , (119) 



^The energy, of course, never becomes negative, since eq. (I116|l contains a divergent 
M-independent piece. Here we concentrate only on the M-dependencc of E. 
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with 

nU, b, a] = ^^^tr ^ [id, - lb,) f/t(x) (d, + tb,) f/(x) 

+ a {U^U-l)] . (120) 

The role of the vector field bi is to impose a U{1) gauge invariance and, 
thereby, to eliminate one degree of freedom. As far as the thermodynamical 
properties are concerned, its effect is only felt as an 0(l/iV^) correction. At 
the level of accuracy of the mean field approximation, we can safely disregard 
it, which we do in the following. The mean field equations are: 

(f/tf/) = l, (121) 



{aU)=0. (122) 

From eq. (jl22j) it follows that either (a) = 0, (U) ^ (the ordered, broken 
symmetry phase with massless Goldstone bosons), or {a) ^ 0, {U) = 
(the disordered, unbroken phase with massive excitations). We are mostly 
interested in the disordered phase, since there the mean field approximation 
should be reliable. Since the symmetry is unbroken, the expectation value of 
a should be proportional to a unit matrix 

{aap) = a'la/3 . (123) 

Eq. (|12H) then becomes 



2A^ 

M Jq (27r)^ k'^ + a^ 

1 - — arctan — ) = A^ . (124) 

The gap equation, eq. ()124p . has solution only for couplings (temperatures) 
g'^{M) larger than the critical coupling (temperature) g^, which is determined 
by the condition that a = 

q% IT 1 , . 

ac = — = . 125 
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The low momentum mode contribution to the ground state energy is 



N'^M 



M 



d^k k^ 



N^ 



-M 



3 



2,. ^ M' 

a M + a arctan — 
a 



(126) 



,0 (27r)3 fc2 + a2 27r2 
The final mean field expression for the ground state energy density is ^ 



E 



4^ 



M' 



2 a^ ac 
"l5 ^ M^a{M) 



(127) 



where a{M) is the QCD couphng at the scale M, ac is given by eq. ()125|1 . 
and a is determined by 



a M 

— arctan — 
M a 



a{M) - ac 
a{M) 



(128) 



The energy as a function of M is plotted on Fig. Qfor A^ = 3. Quali- 
tatively it is the same for any N. The minimum of the energy is obviously 
at the point a{M) = ac- Using the one- loop Yang-Mills /3 function and 
Aqc-^ = 150 Mev, we find for A^ = 3 



M = Aqcd e" = 8.86 Aqcd = 1-33 Gev . 



(129) 



We thus find that the best variational state is non-perturbative and is 
characterized by a dynamically generated mass scale. To get a rough idea 
whether the value of this mass is reasonable we have calculated the value 
of the gluon condensate {a/T^){F^^Fp = {2a /n) {{Bf) - {Ef)). These 
calculations are straightforward, and yield ^ 



{EV) 



247r2 



N^M^ 



{BV) 



407r2 



N^M\ 



and finally 



^ \ flU IJ.U/ 



N 



-M^ = 0.008 Gev^ . 



(130) 



(131) 



1207r2" 

^ We do not distinguish between N"^ and N"^ — 1 since we have neglected the contribution 
of the f/(l) gauge field. The errors are of order l/N'^ and even for N = 2 are very likely 
smaller than the error introduced by using the mean field approximation in the first place. 

^We have again kept only the M-dependent pieces. Each one of the quantities {Ef"^) 
and {Bf"^) is of course positive, due to positive UV divergent, but M independent, pieces. 
It is easy to check that the energy density E = l/2{{Ef'^) + (Bf 2)) :^ -{\/2>{)tt'^)N'^M'^ 
coincides with the first term in eq. H127|) . as it must. 
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Figure 1: Energy density of a variational trial state as a function of the 
variational parameter M in units of Aqcd- The energy is only shown for 
M < 8.86Aqcd, which corresponds to the disordered phase of the effective 
low momentum a-model. 



The preferred phenomenological value of this condensate is 0.012 Gev"^ 
|15j . although the uncertainty in this number is large. Considering this, 
and the unsophisticated nature of our calculation, the agreement is quite 
reasonable. 

A natural question is of course whether one can assign to M directly 
the meaning of some physical mass? In the initial Gaussian wave function, 
before the projection, it appears as the "gluon mass". However the projec- 
tion changes the wave functional very considerably, and the direct meaning 
of M (apart from it being a dimensional variational parameter) is not so 
clear. Nevertheless, naively one expects that, as the gauge invariant opera- 
tors E"^ — B^ and EB to leading order in g"^ are quadratic in the gluon field, 
the scale 1/2M should appear as the correlation length in these correlators. 
Thus we are tempted to identify 2M with the glueball mass. An attempt 
to calculate glueball correlation functions was made in t42 . The result is 
somewhat unexpected. It was found in |12] that the scale 2M does indeed 
dominate the long distance behaviour of the correlation function of the pseu- 
doscalar glueball. However it appears that the scalar glueball correlator at 
long distances is dominated by the scale a — the gap of the effective a- 
model. If correct, this would mean that the phenomenologically acceptable 
scalar glueball mass can only arise in our calculation if the phase transition 
in the effective a-model is of the first order with significant latent heat (see 
next subsection). This point clearly warrants further investigation. 
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4.3 Is it safe? 

For A^ = 3, the value of the QCD couphng constant at the variational scale 
is ac = 0.26. It is reasonably small, so that the consistency condition for the 
perturbative integration of the high momentum modes is satisfied. However, 
it is not so small that higher order corrections would be negligible. We 
expect therefore that including higher orders in perturbation theory could 
give corrections to our result for a(M) of order 25%. Since M depends 
exponentially on a(M), such change in a may change the value of M by 
a factor of 2 — 3. Consequently our result for the dynamical scale M and 
other dimensional quantities should be taken only as an order of magnitude 
estimate. In particular, the value of the condensate (F^) is proportional 
to the fourth power of M, and would change dramatically as a result of a 
moderate change of Uc- This is the normal state of affairs in theories with 
a logarithmically running coupling constant. The best accuracy is always 
achieved for dimensionless quantities, since those usually are slowly varying 
functions of a. The overall scale depends on a exponentially, and therefore 
always has the largest error. 

The use of the mean field approximation to analyze the cr-model intro- 
duces uncertainties into the result. As a rule, the mean field approximation 
gives a good estimate of the critical temperature. Sometimes, however, it 
gives wrong predictions for the order of the phase transition. We believe 
that this is indeed the case here. The mean field approximation indicates 
that the phase transition is second order. The mass gap in the sigma model 
vanishes continuously at the critical point. The universality class describing 
the symmetry breaking pattern {SU{N) ® SU{N)) /SU{N) was considered 
in the context of finite temperature chiral phase transition in QCD. The 
results of e expansion jl^l and also numerical simulations |33 |33] strongly 
suggest that the phase transition is of first order except for N = 2. In our 
case there is an additional Z^ symmetry in the game. However its presence 
is likely only to increase the latent heat rather than turn the transition into 
a second order one. The reason is that the Z^r gauge invariant theory allows 
for the existence of topological defects — the Z^ strings — and condensation 
of topological defects frequently leads to discontinuous phase transitions. 

Nevertheless, we believe that the bulk of our results is robust against 
this uncertainty. The mean field approximation should be reliable in the 
regime where the mass gap in the sigma model is not too small. At the point 
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M = 4.5Aqcd we find 

a = 0.23M, a{M) = 0.38 . (132) 

Since the gap of tlie a-model is at tliis point of tlie order of the UV cutoff, the 
mean field approximation should be reliable here. Perturbation theory is also 
still reasonable at this value of a. The fact that the energy is negative and 
has a minimum for some a{M) < 0.38, seems to be, therefore, unambiguous. 

In fact, independently of the mean field calculation, it is physically very 
plausible that the energy is minimized precisely at the critical temperature, 
on the disordered side of the phase transition (if it is of the first order). Con- 
sider first, the contribution of the high momentum modes to the ground state 
energy, eq. (|116|) . It is proportional to M^ with a fixed (M-independent) pro- 
portionality coefficient x = (A^^ — l)/107r^. Consider now the low momentum 
contribution in the large M region, eq. pi7|) . It is again proportional to M^ 
with the coefficient yo = N'^/12tt'^. The proportionality coefficient of the low 
momentum contribution at the phase transition point, according to our cal- 
culation, is twice as big yc = 2A^^/127r^. This is physically quite transparent. 
In the large M - low temperature regime the global SU{N)^SU{N) symme- 
try of the sigma model is broken down spontaneously to SU{N). This leads 
to the appearance of A^^ — 1 massless Goldstone bosons. At zero tempera- 
ture, those are the only propagating degrees of freedom in the model. All the 
rest have masses of the order of UV cutoff, and therefore do not contribute 
to the internal energy. When the temperature is raised (M is lowered), the 
Goldstone bosons remain massless and other excitations become lighter. If 
the transition is second order, at the phase transition point the symmetry is 
restored, and one should have a complete multiplet of the SU{N) ® SU{N) 
symmetry of massless particles. The dimensionality of this multiplet is 2A^^. 
The contribution of every degree of freedom to the internal energy is still 
roughly the same as at zero temperature. This is so, since, although at the 
phase transition the particles are interacting, critical exponents of scalar the- 
ories in 3 dimensions are generally very close to their values in a free theory 
|4tij . The internal energy of the cr-model at this point therefore should be 
roughly twice its value at zero temperature. Moving now to higher tem- 
peratures, all the particles become heavier, and therefore their contribution 
to internal energy decreases. The internal energy therefore should have a 
maximum at the phase transition temperature. 

Note that the ground state energy of the Yang-Mills theory is the dif- 
ference between the high momentum contributions and the internal energy 
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of the low mode sigina model. Already at zero temperature, these two con- 
tributions differ only by 20%, which is why the coefficient in the expression 
eq. (jllSp . even though positive, is so small. At the critical point, where the 
low momentum mode internal energy is twice as large, the chances of the 
slope becoming negative are very good. This is what happens in our mean 
field analysis but, according to the previous argument, this in large measure 
is independent of the approximation. If the phase transition is first order 
one should be more careful. The internal energy then changes discontinu- 
ously across the phase transition. The particles in the disordered phase are 
always massive, and the internal energy is smaller than in the case of the 
second order phase transition. However, if the transition is only weakly first 
order the same argument still holds (the fact that the mean field predicts 
second order phase transition may be an indication that if it is in fact first 
order, it is only weakly so). It does seem quite likely that the Yang-Mills 
ground state energy will become negative, since it only needs the internal 
energy of the a-model to grow by 20% at the phase transition relative to its 
zero temperature limit. In this case there will be a finite latent heat, which 
means that the internal energy in the disordered (high temperature) phase is 
higher. The Yang-Mills variational ground state energy, therefore, will have 
its minimum in the disordered phase. 

There is good reason to believe, therefore, that these results are qualita- 
tively correct, and will survive the improvement of the approximation. 

4.4 Inst ant ons 

Instantons are believed to play a very important role in the non-perturbative 
dynamics of QCD ^3]- It is, therefore, interesting to see if our variational 
calculation has any relation to the instanton approach. 

The first thing is to understand how we expect to see instantons in this 
formalism. Instantons are localized, finite-action classical solutions of the 
field equations of QCD in Euclidean space-time. Physically they represent 
tunnelhng processes between topologically distinct vacuum sectors with the 
exponent of the instanton action being equal to the transition probability 
between two of these vacuum states. Although the notion of the instanton 
is intrinsically Euclidean, the tunnelling between different vacuum sectors 
can be formulated both in the Hamiltonian and the Lagrangian languages. 
In fact, the gauge projected variational approach we discuss here is very 
well suited for this purpose. The projection of the initial Gaussian onto 
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the gauge invariant subspace is achieved by the integration over the gauge 
group. The effective a-model arises as an integration over the relative gauge 
transformations between the two Gaussian states in the hnear superposition 
eq. (f7T|) . The Boltzmann factor exp(— r[f/]) for a given matrix U is therefore 
just the overlap of the initial and the gauge rotated state or, in other words, 
the transition amplitude between the two states^. The instanton transition 
is precisely a transition of this type, where the two states (at t —>■ — oo 
and t — i> +00) are related by a large gauge transformation. The matrix 
of this large gauge transformation must carry a non-zero topological charge 
Us{SU{N)). 

The integration measure over U indeed includes integration over topolog- 
ically non-trivial configurations. The finiteness of the action eq. (|83p requires 
that the matrix U approaches a constant value at infinity. This identifies all 
points at spatial infinity, hence the physical space of the model is 5*^. Field 
configurations are maps from 5*^ into the manifold of SU{N) and are clas- 
sified by their winding number, or topological charge, which is an element 
of the homotopy group Il3{SU{N)) = Z. The a-model action in a given 
topological sector is minimized on some configuration which is a solution 
of classical cr-model equations of motion. In particular, the solution with a 
unit topological charge is expected to have a "hedgehog" structure much like 
the topological soliton in the Skyrme modeip21- The integral over U in the 
steepest descent approximation is saturated by these classical solutions. 

These cr-model configurations that belong to a non-trivial topological sec- 
tor with a unit winding number represent QCD transitions between the topo- 
logically distinct sectors. The topologically non-trivial classical soliton solu- 
tions of the (T-model are therefore the three dimensional images of the QCD 
instantons. 

The QCD instantons are defined in space-time and are therefore four 
dimensional point-like objects. The cr-model solutions are intrinsically three 
dimensional. Nevertheless, there is a natural simple relation between the 
two. For a given Yang-Mills instanton solution A™'^*(x^) one can find a three 
dimensional SU{N) matrix f/(xj) by the procedure discussed by Atiyah and 
Manton gTj, 

Uam{x,) = Pexp (i j dx^'/^^A , (133) 



^Since the space of the matrices U is continuous, strictly speaking the Boltzmann factor 
is the differential rather than the total amplitude. 
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where the contour of integration C is a straight hne Xj = const, — oo < 
Xq < oo. The matrix Uam gives the relative gauge transformation between 
the initial trivial vacuum at xq — > — cxd and the topologically non-trivial 
vacuum at xq — >■ -|-cxd or, in other words, between the initial and final states 
of the instanton transition. Clearly, its meaning is precisely the same as 
that of the classical soliton solution of the effective a-model eq. (jH^ . Also, 
the QCD instanton action and the a-model soliton action have the same 
physical meaning. They both give the transition probability between different 
topological sectors in QCD. We will therefore refer to the cr-model solitons 
as instantons in the following. 

Although the QCD and the a-model instantons have the same physical 
meaning, it is not assured that the numerical value for their respective actions 
is the same. They both approximate the value of the transition probability in 
QCD, but the approximations involved are quite different. The QCD instan- 
ton action is the result of the standard WKB approximation which is valid at 
weak coupling and therefore for small instantons, but breaks down for instan- 
tons of large size. The a-model instanton action on the other hand is the value 
of this transition probability in a particular Gaussian variational approxima- 
tion. It is natural to expect that the variational calculation underestimates 
the value of the transition probability at very weak coupling. The transition 
probability is given by the overlap of the "ground state" wave functions in 
two topological sectors. For simplicity let us consider a quantum mechanical 
system with two vacua at x±. If the area below the barrier separating the 
vacua is large, the standard WKB instanton calculation is applicable. The 
wave function of each of the vacua below the barrier has essentially an expo- 
nential fall off exp{i J^ ^ E — V{x — x±)}. The instanton calculation is the 
calculation of the overlap of these functions. Our variational calculation cor- 
responds to approximating the respective "ground states" at x± by Gaussian 
wave functions. The tails of the Gaussians fall off much faster away from the 
minimum than the actual wave function and the overlap is therefore expected 
to be smaller. When the coupling constant is not too small (or when the area 
below the barrier is not too large) the overlap between the two states is no 
longer determined by the behaviour of the "tails" of the wave functions. In 
this situation one can expect the Gaussian approximation to do much better, 
since the overlap region contributes significantly to the energy and therefore 
plays an important role in the minimization procedure. 

In fact, implicitly, the instantons played a very important role already in 
the energy minimization described in this section. As we have seen above, 
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the energy is minimized for the value of the mass parameter M at which 
the cr-model is in the disordered phase. The transition between ordered and 
disordered phases in a statistical mechanical system can usually be described 
as a condensation of topological defects. This is a standard description of the 
phase transition in the Ising and XY models |48.. In the a-model eq. ()83|] 
the relevant topological defects are none other than the instantons. In this 
sense the appearance of the dynamical mass in the best variational state is 
itself driven by the condensation of instantons. 

Perhaps the most significant difference between the kinks in the Ising 
model and the QCD instantons, is that the former have a fixed size, while 
the latter come in a variety of sizes. This is a direct consequence of the 
dilatation symmetry of the classical Yang-Mills action. It is not necessary 
for instantons of all sizes to condense in order to drive the transition. The 
naive expectation therefore is that the large size instantons (larger than 1/M) 
condense, while the smaller ones should still exist as semi-classical solutions 
in the effective a-model action. 

The simple qualitative argument to this effect is the following. Consider 
the effective a-model action for very large size instantons. In such a configu- 
ration only field modes with small momentum k < M are present. For these 
momenta the action is the standard local cr-model where M plays the role of 
an ultraviolet cutoff 

r = ^^^tr ^ d,U^ix)d,Uix) . (134) 

If the large size instantons are stable at all, they should also be present 
as stable solutions in this local action eq. ()134|1 . However this is not the 
case as can be easily seen by the standard Derrick type scaling argument. 
Take an arbitrary configuration u{x) in the instanton sector and scale all the 
coordinates by a common factor A. Then obviously 

T[u{Xx)] = X-^T[u{x)] . (135) 

The dependence of the action on A is monotonic and is minimized at A — >■ oo. 
This means that the instantons in the local cr-model shrink to the ultraviolet 
cutoff 1/M. For instantons smaller than the inverse cutoff we cannot use the 
local action anymore. However, the behaviour of these small size instantons 
is already familiar. We know that classically they exist at arbitrary size, but 
that when the running of the coupling is taken into account, these instantons 
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are pushed to the large size. This is the famihar infrared problem of large 
instantons. In our variational state, the coupling constant stops running at 
the scale M. The picture is therefore very simple. The small size instantons 
are pushed to larger size by the effect of the coupling constant, while the 
large size instantons are pushed to smaller size by the effect of the local a- 
model scaling. We therefore expect that the instanton size will be stabilized 
somewhere in the vicinity of p ~ 1/M. 

The behaviour of the instantons in the variational ansatz eq. ()71|) was 
studied in detail in |1U]. The results are indeed very much in line with 
the expectations just outlined. The action of a small size instanton in the 
(T-model was found to be independent of its size (neglecting the running 
coupling effects), and numerically equal to 

r = 1.96^. (136) 

r 

This is about twice the value of the instanton action in QCD: Tinst = ^■ 
Thus, as expected, the tunnelling transition amplitude is underestimated in 
the Gaussian approximation for small instantons. Interestingly enough, how- 
ever, the actual configuration of the cr-model field that minimizes the action 
in the one instanton sector was found to be practically indistinguishable from 
the Atiyah-Manton expression calculated on the QCD instanton. This means 
that even though the value of the transition probability is underestimated in 
the Gaussian approximation, the actual field configurations into which the 
tunnelling is most probable are identified correctly — they are precisely the 
same as in the WKB calculation. 

As for the large size instantons, when the running of the coupling constant 
is taken into account their size is stabilized at about p = (1 — 1.5)/M. The 
uncertainty is to do with the way the running of the coupling constant is 
modified at k < M. It is in fact interesting to note that this instanton size 
is consistent with the average size of the instantons in the instanton liquid 
model of 1201 EH IS21 EBl Elj- For the case of SU{2), the average instanton 
size, in units of the gluon condensate obtained in the instanton liquid model, 
turns out to be [H ^IHI^ lHi]. 

p{{F^,F;,)a/nY^'^OA. (137) 

In our case, taking the value of the gluon condensate obtained in the varia- 
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tional approach, we find 

p((F^^F;,)a/7r)'/'~0.2-0.3. (138) 

The relation to the instanton liquid model is an interesting question which 
deserves further study. 

We also note that the variational Ansatz which has been considered so far 
corresponds to a zero value of the QCD ^-parameter, since we have integrated 
over the entire gauge group without any extra phases. As is well known, the 
general 6*- vacuum is defined as 



>= 



^e'"V> (139) 



where n labels the topological sectors in the configuration space (space of all 
potentials A°:{x)). Generalization of our trial wave functions to non-zero 
is trivial — all we need to do is to insert in eq. (|71|) an extra phase factor in 
the integrand 

exp U^ j dxe,,ktT [{U^diU){U^djU){U^dkU)] | . (140) 

The integrand here is a properly normalized topological charge, and it takes 
integer values for topologically non-trivial configurations U{x), i.e. this fac- 
tor reproduces the exp(in6') term in eq. ()139|) . This phase factor can be also 
obtained if one remembers that usually the 6'-dependence of the wave func- 
tional is given by the exp [iOScsi^)], where Scs{A) is a Chern-Simons term, 
which under the gauge transformation U transforms as 

ScsiA"") = Scs{A) + ^jdxe,,ktT [{U^d,U){U^d,U){U^dkU)] , (141) 

so that integrating over U leads precisely to the phase factor eq. ()140|) . The 
state thus constructed is an eigenstate of an operator of the large gauge 
transformation with eigenvalue e*^. This modification results in the addition 
of the same topological term to the effective action eq. (jHHjl . It is amusing 
to note that for 6 = tt, the "Skyrmions" in the effective theory will be 
"fermions" . 

While the extension of the variational calculation to non- vanishing ^-term 
is quite straightforward, it has not been performed thus far. 
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4.5 Confinement? 

The most interesting question is of course whether our variational state is 
confining. In the toy model in 2+1 dimensions discussed in the previous sec- 
tion, we were able to answer this question by calculating both the expectation 
value of the spacial Wilson loop, and the potential between static charges. 
Unfortunately, in the Yang-Mills theory the calculation is much more com- 
plicated and the answer is not known. Although there are some arguments 
that the state is indeed confining (see next section), it has not been proved 
or disproved by a direct calculation. The calculation of a potential between 
static charges d la jlO] has not been attempted. As for the calculation of the 
Wilson loop, some progress has been made in reducing this calculation to the 
cr-model level, but no final result has been obtained. 
The difficulty in the calculation of the Wilson loop 

W{C) = /tr Pexp (d I (ixiA^r" j \ (142) 

is to take into account the P-ordering of the exponent. One way of doing 
so is to introduce new degrees of freedom living on the contour C which, 
after quantization, become the SU{N) matrices r" [S3]. We briefiy describe 
the construction in the case of the SU{2) group — the generalization of this 
construction to an arbitrary Lie group has been discussed in f^j . 

The construction is based on the observation, made in [HEl ISZ| , that in- 
stead of considering the ordered product of r"" matrices one can consider the 
correlation function 

r"(ti)r^(t2) T%tk)\ , „,^, ,,^. ,,^ .. 



Dn{t)rf{ti)n\t2) . . .n\tk) 



exp 



i(^ + l/2) / d^ie^^e^'^^n'^d^n^d^rf 



(143) 



where S is the spin of the representation, i.e. for the fundamental represen- 
tation S = 1/2; n"(t) is a unit vector n'^n'^ = 1 living on a contour C {t is 
a coordinate on the contour); and S is an arbitrary two-dimensional surface 
with the boundary C = (5S. The two-dimensional action 

S[n]= I d^ie^^e'^^^n'^d^n^d^n^ (144) 
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depends only on values n"'{t) at the boundary. 

It can be shown that the Wilson loop can be rewritten as 



W{C) = { / Dn{t) exp 



E 



exp ig (p dxiA1{x{t))n''{t) ) (145) 



c 



The average over Ai can be performed using eqs. ()83l97p 



(exp ( ig (h dxiA'^{x{t))n''{t) \ ) = exp ( -ig (h dxia1{x{t))rf{t) 

•exp {-]- / / dt^dhx,{h)Uh)n\h)n\t2){M-^Y\x,y)] (146) 



c Jc 



where a" was defined in eq. (1^ . The Wilson loop can be calculated as the 
average over two scalar fields: U{x) living in the whole space and n°'{S,) living 
on a two-dimensional surface S such that C = 5E 



W{C) = I DU Dn exp {-T[U] + iS[n]) exp i-ig (b dx^ <(x(t))r2"(t) 
■exp ( -1 / / dhdt2X,ihMh)n''ih)n\t2)iM-'r\x,y)] . (147) 



c Jc 



In the infrared limit one can use eq. (|lU4p to simplify eq. (jl47|) and get 



W{C)= f DUl f Dnexp{-TL[U]+iS[n]) 
■exp(-i^-j dxi\'lL{x{t))n\t) 



■ exp I -- 0) (L dtidt2 Xi{ti)yi{t2)n''{ti)n''{t2)G{x - y) 
j DUHexp{-TH[U])exp (-z^ ^ dxAl H«t))S>''{t)\ . (148) 
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Integrating over Uh one obtains 
W{C) = fDnexp{iS[n]) 



2 -'c Jc 



■ exp -- i (b dtidt2X,{ti)yi{t2)n'' {11)71°- {t2)G{x - y) 



Dt/exp(-r[f/])exp 



- i dxiii{T''U^diU)n\t) 
2 Jc 



(149) 



where the integration DU is over the low momentum modes only and r[f/] 
is the corresponding low momentum action. Since G{x — y) is short range, 
the term 

exp (-^ i I dtidt2Xi{ti)yi{t2)n\ti)n\t2)G{x - yU (150) 

gives only perimeter dependence and can be neglected when calculating the 
string tension. Now reverting back from n" to t° we find 

W{C) = /trPexp f- / dkU^diU]\ (151) 

where the averaging is performed with the low momentum a-model action. 
This is reminiscent of the average of the monodromy operator 

M = trPexp ( / dk U^diUj (152) 

and one might expect that the result is similar. Since the target space of 
the sigma model is A1 = SU{N)/Z]^, and ni(A^) = Z^, the monodromy 
can take on values exp{i27m/N). It has a natural interpretation in terms 
of the topological defects in the sigma model. As mentioned above, the 
topology of the cr-model allows for the existence of Zj^ strings (the soliton- 
instantons discussed in the previous subsection do not play any special role in 
the monodromy calculation). The string creation operator and the operator 
M satisfy the commutation relations of the t'Hooft algebra (SHj- Therefore, in 
the presence of a string, the operator M has expectation value exp{i27Tn/N), 
where n is the linking number between the loop C and the string. As we 
have argued, the sigma model is in the disordered phase and the disordering 
can be thought of as the condensation of the topological defects. We have 
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thus far discussed skyrmions (instantons) as the relevant defects, but it is 
quite plausible that the Z^ strings are condensed as well. That would mean 
that the vacuum of the sigma model has a large number of strings and also 
that the fluctuations in this number are large. In this situation the VEV of 
A4 must average to zero very quickly, and for large loops will have an area 
law. We then may expect that the Wilson loop will also have an area law 
W{C)^exp{-a'A). 

While this argument is not implausible, currently we have no quantitative 
method of estimating eq. ()15H) . 

Even though the question about confining properties of the state remains 
unanswered, the results of the variational calculation so far are quite inter- 
esting. It yields the dynamical generation of the scale which is of the right 
magnitude, a reasonable value of the gluon condensate and a neat relation 
to the instanton physics. All these results are intrinsically non-perturbative. 

Apart from the vacuum structure, there is another mysterious domain 
of the QCD physics which is not accessible with perturbative tools: the 
deconfining phase transition. We may hope that the variational method 
can give us a handle to understanding the deconfinement physics. In the 
next section we describe its application to the Yang Mills theory at finite 
temperature and the study of the deconfinement phase transition. 

5 The Yang Mills theory at finite tempera- 
ture 

Attempts to understand the nature of the deconfining phase transition in 
QCD date back almost 30 years. Since the pioneering work of Polyakov ^U] 
and Susskind ^U^, much effort has been made to study the basic physics as 
well as the quantitative characteristics of the transition. The high tempera- 
ture phase of QCD is widely believed to resemble an almost free plasma of 
quarks and gluons. At asymptotically high temperatures this is confirmed by 
explicit perturbative calculations of the free energy jHI]. Perturbation theory 
in its simplest form, however, is valid only at unrealistically high tempera- 
tures. In recent years a different and promising avenue has been explored. 
This incorporates analytical resummation of the effects of the gluon screening 
mass into the 3D effective Lagrangian, which is then solved numerically by 
3D lattice gauge theory methods jn2l ESI EH EH] ■ The results of this approach 
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seem to be in agreement jHSl with direct 4D lattice gauge theory calculations 
[HZl IHHI all the way down to 2Tc. Although we are quite advanced in the 
understanding of the high temperature phase, the transition region itself is 
very poorly understood. This region of temperatures, Tc < T < 2Tc, is of 
course the most interesting one, since it is in this region that the transition 
between "hadronic" and "partonic" degrees of freedom occurs. Interestingly 
enough, the numerical results indicate that although asymptotically the free 
energy does approach that of the free partonic plasma, the deviations from 
the Stefan-Boltzmann law even at temperatures of order lOTc are quite siz- 
able, of order of 15%. This is an indication that the interesting physics of 
the transition region remains important even at these high temperatures. 
The study of the transition region itself is a complicated and inherently non- 
perturbative problem. 

The purpose of this section is to study the deconfining phase transition 
in a pure SU{N) Yang-Mills theory using the variational approach described 
in the previous sections suitably extended to finite temperature [02] ■ We 
will minimize the relevant thermodynamic potential at finite temperature, 
i.e. the Helmholtz free energy, on a set of gauge invariant density matrices. 

5.1 The variational Ansatz for the density matrix 

The equilibrium state of a quantum mechanical system at finite temperature 
is not a pure state, but is described by a mixed density matrix. Thus in 
order to extend the variational analysis to finite temperature we have to 
generalize our ansatz eq. (f7T|) so that it includes mixed states. In scalar 
theories the Gaussian approximation has a long history of applications at 
finite temperature jTOJI^. We generalize our Ansatz along the same lines. 

We start by considering the density matrices which in the field basis have 
Gaussian matrix elements 



~g[A, A'] = exp I - i ^ At{x)GT^'^\x, y)A%y) 

+ A'nx)GT^"^\x,y)A'^\y) - 2A^(x)i7lf (x,y)A;^(y)| . (153) 
As before, we take the variational functions diagonal in both colour and 
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Lorentz indices, and translationally invariant 



GT^'^'ix.y) = 5^%,G-\x-y), (154) 

H^;'ix,y) = 5''%Hix-y). 

Then 

g[A, A'] = exp I - - /" {AG'^A + A'G'^A' - 2AHA') \ . (155) 

For H = this density matrix represents a pure state, since it can be written 
in the form 

^= |^[A] ><^[A]| (156) 

with "^[A] a Gaussian wave function, eq. (j7(jp . At non-zero H the density 
matrix is, however, mixed. The magnitude of H, therefore, determines the 
entropy of this trial density matrix. 

We now make an additional simplification in our ansatz. First, we restrict 
the functions G~^{x) to the same functional form as at zero temperature 
eq. dZHD, i.e. 

^ ^^) = 1m ifA;^<M^ • ^^^^) 

Further, we will take H{k) to be small and non-vanishing only at low mo- 
menta 

^^^^~\ i7«M iik'<M' ■ ^^^^^ 

The logic behind this choice of Ansatz is the following. At finite tem- 
perature we expect H{k) to be roughly proportional to the Bolzmann factor 
exp{ — E(A;)/3}. In our ansatz, the role of one particle energy is played by 
the variational function G~^{k). We will be interested only in temperatures 
close to the phase transition, and those we anticipate to be small, Tc < M. 
For those temperatures, one particle modes with momenta k > M are not 
populated, and we thus can put H{k) = 0. For k < M the Bolzmann factor 
is non-vanishing, but small. Further, it depends only very weakly on the 
value of the momentum. We will have, of course, to verify a posteriori that 
our assumptions about the smallness of Tc and H are justified. 
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As before, we explicitly impose gauge invariance by projecting q onto the 
gauge invariant sector 

QiA.A] = f DU'DWexp I - - / A^'G^'A^' 

+ A'''"g-'a"'"-2A^'HA"'"\, (159) 



where A'^ is given by eqs. ()72l73p . One of the group integrations in eq. 

is redundant, since we will only calculate the quantities of the form TtqO, 

with O being gauge invariant. Our Ansatz for the density matrix then is 

g[A,A']= f DUexpl -- f AG-^A + A'^G~^A'^ -2AHA'^\. (160) 

This expression is not explicitly normalized to unity. Nevertheless, we find 
it convenient to refer to it as density matrix while explicitly inserting a nor- 
malization factor whenever necessary. Thus the average of a gauge invariant 
operator O is given by 

{0)a,u = Z-'Ti{gO) 

= Z-^ f DUDAO{A,A') 

• exp I - i f AG-'A + A'^G-'A'"^ - 2AHA''^ 

where Z is the normalization of the trial density matrix g, i.e. 

Z = TTg= f DUD A exp I - - /" AG'^A + A^G'^A^ - 2AHA^ 

= f DUDAexpl-- f AAA + X{G^^ ~ujA-^uj^)x\ (162) 



(161) 



A'=A 



with 



A = A + \uA-\ (163) 

A = 2^-^(1-^(5 + 5^)), (164) 

00 = (G-^S-H). (165) 
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The A integration can be performed to yield 

Tr^ = f DUexpl - -\ (G^^ - luA-^lu'^) A - -Tr In — j . (166) 

We now adopt the same strategy for treating the high momentum modes 
of f/ as at T = 0. Namely, they are integrated perturbatively to one loop 
accuracy. The result is the effective a-model for the matrices U with mo- 
menta below M. The coupling constant g of this cr-model gets renormalized 
as before according to the one loop Yang-Mills /5-function, and thus has to 
be understood as g{M). Additionally, due to independence of H on momen- 
tum, for low momentum modes of U the function H{x — y) is equivalent to 
H6^{x — y). 

The final approximation has to do with the fact that H is assumed to 
be small. For arbitrarily large H the variational calculation is forbiddingly 
complicated even with all the above mentioned simplifications. This is be- 
cause the gauge projection renders the calculation of entropy in the general 
case unfeasible. However, at small H we only need to calculate the leading 
term in entropy. This calculation can indeed be done, and is described in the 
following. Since we are only calculating the leading order contribution in H, 
we only have to consider corrections to the a-model action of first order in 
H . With this in mind, the normalization factor becomes 

TrQ = j DUexv{-\^{^ + ^{S + S^))\^\HGii{S + S^)\. (167) 

5.2 The effective cr-model 

Just like at zero temperature, the normalization Z can be interpreted as the 
generating functional for a theory defined by the action S{U) 

Z = Tr0= f DUe~^^^\ (168) 

where 

S(U) = — AA + -A H(S + S^)\- -^HMhrS . (169) 

^^48 47r2 ^ ^ 
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We simplify this expression using 

AA = ^tiidUdU^), (170) 

XS^X = XSX = — \rtT\(U^dU - dU^UMdUU^ - UdU^)] , (171) 
2g^ L J 

tr5 = tiS^ = tiUhiU - 1 . (172) 

Inserting these into the action we get 

S{U) = ^ti{dUdU^) - ^ii\{U^dU - dU^U){dUU^ - UdU^) 



2^ 



^-HMhiUhiU, (173) 
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where t/-independent terms have been dropped. 

At this point it is useful to relate our effective a-model with a standard 
tool used in finite temperature calculations, namely the effective action for 
the Polyakov loop. The matrix U plays a similar role to the Polyakov loop 
P at finite temperature — the functional integration over U projects out the 
physical subspace of the large Hilbert space on which the Hamiltonian of 
gluodjTiamics is defined. The effective a-model eq. ()173|) therefore is a close 
analogue of the effective theory for the low momentum modes of the Polyakov 
loop variable. Its status and applicability region are however different from 
the usual perturbative effective actions, see e.g. [72]. The standard effective 
action is calculated in perturbation theory and is valid at high temperature. 
Our effective action eq. p73p depends on the variational parameters M and 
if, and in a sense is a variational effective action. Also due to our restrictions 
to small values of if, a priori we do not expect it to be valid at high tem- 
peratures but, rather, it should represent correctly the physics in the phase 
transition region. 

Another important difference is that our effective a-model does not have 
the local gauge invariance U{x) -^ V'^ {x)U {x)V [x) which is usually asso- 
ciated with the effective action for the Polyakov loop. The reason for this 
is that our setup is different from that of the standard finite temperature 
calculation. The way this gauge invariance usually appears is the following. 
Consider the calculation of any gauge invariant observable in the equilibrium 
density matrix at finite temperature 

{O) = jDUTT[exp{-pH}OgiU)] , (174) 
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where g{U) is the second quantized operator of the gauge transformation 
represented by the matrix U. This expression for fixed U can be compared 
to the same expression but with U gauge transformed 

Tr[exp{-(3H}OgiV^UV)] = TT[exp{-pH}OgiV^)giU)giV)] 

= TT[exp{-pH}OgiU)] . (175) 

The last equahty here follows from the fact that both O and exp{/3H} are 
gauge invariant, and thus the operator g{V^) can be commuted all the way to 
the left. The only effect of the transformation is then to change the basis over 
which the trace is being taken, which obviously leaves the trace invariant. 

Our variational setup is somewhat different. Expectation values are cal- 
culated as 

f DUTT[gg{U)0] (176) 

with g defined in eq. (|155j) . This expression is altogether gauge invariant, 
since the integral over U correctly projects only the contribution of gauge 
singlet states. However the operator g is not itself explicitly gauge invari- 
ant. For that reason the gauge transformation operator giV^) cannot be 
commuted through it, and thus 

Tr[gOgiV^UV)] ^ Ti\gOg{U)\ (177) 

even for gauge invariant operators O. This manifests itself as absence of local 
gauge invariance in the action of the effective a- model, eq. (|173j) . 

Nevertheless, we stress again that since the integration over the SU{N) 
valued field U projects out the physical Hilbert space, its meaning in this 
sense is the same as that of the Polyakov loop. 

5.3 The Calculation of the free energy 

To find the best variational density matrix we have to minimize the free 
energy with respect to the variational parameters M and H. The Helmholtz 
free energy F of the density matrix g is given by 

F=(H)-TS, (178) 

where H is the standard Yang-Mills Hamiltonian eq. ()67p . S is the entropy, 
and T is the temperature. 
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Thus 

F = - (riiE^g) + Tt{B^0)^ + T ■ Tr(f? In g) . (179) 

First of all we need to perform the integration over the gauge fields, and 
reduce this expression to the average of a fZ-dependent operator in the effec- 
tive (T-model. In fact, as we shall see soon, to leading order in H the only 
non-trivial calculation we need to perform is that of the entropy. 

We will calculate the entropy up to the first non-trivial order in H. As 
we now show, the leading term at small H is 0{H In H). 

Let us denote by qq the density matrix of the pure state with H = 0: 

f?o = |0)(0|. (180) 

Here |0) does not denote necessarily the actual ground state, but rather a 
projected Gaussian state with arbitrary M. Now, since the matrix elements 
of the density matrix can be expanded in powers of H, to leading order we 
can write 

g = go + 5g, (181) 

where 6g is 0{H). 

Imagine that we have diagonalized g. It will have one large eigenvalue 
ao = 1 — 0{H), which corresponds to the eigenstate 

|0') = |0) + 0(/J) . (182) 

All the rest of the eigenvalues ai are at most 0{H). Then the entropy can 
be written as 

oo 

S = — Tr(^ In g) = —uq In uq — 2^ cut In ctj • (183) 

The second term is 0{H In H), and it is the coefficient of this term that 
we will now calculate. Neglecting 0{H) corrections, we can substitute a, = 
H/M under the logarithm. Thus to leading logarithmic order 

S = -^ai\nH/M. (184) 

i 

Thus we have to calculate Xli*^*- L^t 

|0') = |0) + iJ|a;) . (185) 
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Then 

oo 

^ = ao|0')(0'| + J]ai|xi)(xi| (186) 

with 

{xi\0') = 0. (187) 

Note that {0\x) ^ 0, but 

(0|x) + (a;|0) = 0, (188) 

since |0') has to be normahzed at 0{H). Also 

{xi\0) + H{xi\x)=0. (189) 

Thus the overlap (xj|0) is 0{H), and we have 

g = ao|0)(0| + h(\0){x\ + \x){0\] + ai\xi){xi\ . (190) 

Multiplying this by Qq we get 

00-0 = ao-go + H\0){x\+H{0\x)\0){0\, (191) 

^■^0 = ao-^o + i^|a;)(0|+/f(x|0)|0)(0|. 

Thus, 

Qo- Q + Q- Qo - Q = ao- Qo - ai\xi){xi\. (192) 

Multiplying again by Qq, we get rid of |xj)(xj| to 0{H) 

ao- Qo = Qq- Q + Qq- Q- Qq- Qo- Q = Qo- Q- Qo. (193) 

Then, 

«o = Tr(^o ■ Q) . (194) 

Since Tr^) = 1 we have 

^cti = 1 -ao = Tr(£io(l- ^)) (195) 



which, inserted into eq. ()183p . gives 

S = -(l-Tr(^o-^))lni^/M. (196) 
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The derivation has been given for the normahzed density matrices ^o and 
Q. In terms of our Gaussian matrices we should restore the normahzation 
factors Z and Zq, so that finally we have 



1 ) In H/M . 



Tr(go ■ g) 
.Tr^o -Trg 

It is easy to check that to 0{H) 

Tr(go ■ p) = (Tr^o)^ 
and 

From eq. (|167p . it is clear that 



Tig 



l + H{-^MhrS--XS\ 

4:71^ 4 



■ Tr^o ■ 



(197) 

(198) 
(199) 

(200) 



Using eqs. p71ll72|) we finally get 



S = - 



8^2 



ti{WdU - dU^U){dUU^ - UdU^) 
1 



+ 
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M2(trf/1'trf/-i; 



u 



H In H/M. (201) 



To leading order in H, the averaging over U in this expression has to be 
performed with the a-model action with H = 0. 

The expression of eq. (j201|) has the following striking property. For M < 
Mc it vanishes identically. The reason is very simple. The first term in 
eq. ()201|) is the product of the left handed SU{N) current and the right 
handed SU{N) current in the cr-model. Thus it transforms as an adjoint 
representation under each one of the SU{N) factors of the SUl{N) ^ SUr^N) 
transformation. The same is also true for the second term in eq. ()20H) . 
The cr-model action at if = is itself obviously invariant under the whole 
SUl{N) (g) SUji{N) group. Now, at M < Mc, the symmetry group is not 
spontaneously broken, and thus any operator which is not a scalar has a 
vanishing expectation value. It follows immediately that the entropy has an 
0{H\nH/M) contribution only for M > M^, when the SULiN) ® SUr{N) 
group is spontaneously broken down to SUv{N). 
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This observation makes our task considerably simpler. Since for M < M^ 
the entropy is zero, we do not have to consider at all the disordered phase 
of the effective a-model. In this disordered phase the free energy coincides 
with energy, and thus the calculation is identical to the calculation at zero 
temperature presented in sec. (JD). 

Thus we only need to consider the effective a-model in the ordered phase. 
As at T = we perform the calculations in the ordered phase to leading order 
in as- Since there are no 0{H In H/M) corrections to energy at this order, 
the result for the energy in the disordered phase is identical to the result at 
zero temperature, eq. (jllSj) . Thus our expression for the free energy in the 
ordered phase of the cr-model is 

iV2 



1207r- 



-M^ + j((^ti(U^dU - dWU)(dUW - UdW) 

+ ^M^{tTUhTU-l)\ jHlnH/M. (202) 
We now average over U in the leading order perturbation theory. 

5.4 The cr-model perturbation theory 

For the purpose of the perturbative a-model calculation we parameterize the 
U matrices as 

t/ = exp{^#V"}. (203) 

Although we only need the leading order, it is instructive to check that 
the order g^ term in the expansion is indeed small. To this order we have 

U- (l + -g(P''T'' - -^20a0VV^ - — ^3^'^^VrW") . (204) 

V 2 8 48 / 

So that the a-model action becomes 

M 
S = tiidUdU^) 

= —d(j)d^ + —g\d(l)''){d(l)'')(p''(phi [rVVV'^ - rWV'^j . (205) 
4 192 L J 

The propagator of the phase field is thus 

2 

MA;2 



(</'>') = 7777^"'- (206) 
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To get the idea of the quahty of this perturbative expansion we can calculate 
for example (S). In this calculation one has to take into account the fact 
that the measure in the path integral over the phase 0" is not the simple D(j), 
but rather the group invariant U{N) measure fi . To first order in g"^ it is 

/. = Drexp{^g'j£xcp'ix)] . (207) 

Taking this into account we find that (S) gets no correction of order g"^. We 
thus feel confident that the use of the perturbation theory in the ordered 
phase of the a-model is an admissible approximation. In the following we 
will only keep leading order expressions. 

Calculating to leading order the entropy eq. ()201|) and keeping only the 
0{N'^) terms we find 

{S) = -^^M^H\n^ (208) 

Introducing the dimensionless quantity 

h = I (209) 

we can write the expression for the free energy as 

F = (H)-T(S) 

-M^ + J^M^h\ah. (210) 
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We now have to minimize this expression with respect to h and M. It is 

convenient to first perform the minimization with respect to h at fixed M. 

This obviously gives 

9F 1 

Thus as a function of M only, the free energy becomes 

Ar2 T N'^ 

M^--—M\ (212) 
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Now minimizing with respect to M we find 



df 15T , , 

= 0^M= . 213 

dM e ^ ' 
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Thus for M > M^ the free energy of the best variational density matrix 
as a function of temperature is 



A^^ /15T\4 



M>Mc 



3607r2 



[—) ■ P") 



We now have to compare this value with the free energy for M < Mc- 
As we have discussed above, this is given by the expectation value of the 
Hamiltonian alone, and is minimized at M = Mc. Its value is 



N' 



2 



M<Mc 
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Comparing the two expressions we find 



M^. (215) 



Tc = 


15^- 


Using the value of M^. from eq. p21 


H) we have 



(216) 



Tc = 450Mev. (217) 

For T < Tc the free energy is minimized in the variational state with 
M = Mf.. In our approximation this state is the same as at zero temperature. 
Its entropy vanishes, and the effective a-model is in the disordered phase. The 
Polyakov loop vanishes, ([/) = and according to the standard wisdom this 
is a confining state. 

For T > Tc the best variational state is very different. The entropy of 
this state is non-zero, 

= = 6a(— ) ■ Pi«' 

The Polyakov loop is non-zero (f/) 7^ and thus the high temperature density 
matrix describes a deconfined phase. 

Finally, we note that in the deconfined phase our best variational density 
matrix has a non- vanishing "electric screening" or "Debye" mass. The Debye 
mass is conveniently defined as the "mass" of the phase of the Polyakov loop. 
This mass is non-vanishing in our calculation for the following reason. As 
long as if = 0, the effective a-model action has a global SUl{N) (g) SUr{N) 
symmetry. Thus in the ordered phase of the a model the phases 0'* are 
massless. However, as discussed above, the terms of order H in eq. (jl73|) 
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break this symnietry explicitly down to the diagonal SUv{N). As a result 
the would be "Goldstone" phases 0" acquire mass. To calculate this mass it 
is convenient first to note that to 0{g'^) 

tT{U^dU - dU^U){dUU^ - UdU^) 

= -4:ii{dU^dU) - — 0'^0^(90^90'^tr(rVVV^ - rVVV"^) . (219) 

The contribution of the SUl{N) SUr{N) term to the mass cancels against 
the contribution of the measure eq. ()207|) . Using eqs. ()173U219|) we then find 
to 0{g'^) and to leading order in H 

Ml = —as{M)NMH . (220) 

As a function of temperature we have 

Ml = aA)N^-^j\ (221) 

Let us summarize the results of our analysis of the deconfinement transi- 
tion. We find the phase transition at a temperature of about T^ ~ 450 Mev. 
The transition is strongly first order at large N . The latent heat is AE = 
g^ (i|^) . Below the transition the entropy is zero, the best variational state 
is the same as at zero temperature, and the average value of the Polyakov 
loop is zero. Above the transition, the entropy is non-zero and proportional 
to the number of "coloured" degrees of freedom, S oc N'^. The average value 
of the Polyakov loop is non-zero and the phase is deconfined. 

It is quite interesting that at high temperature our formulae numerically 
are quite close to the predictions of free gluon plasma. In particular, our 
value for the free energy, eq. ()214|) . should be compared to the free gluon 
plasma expression 

ffree = -^T\ (222) 



The ratio between the two is 



free 



0.85. (223) 



The ratio of the entropies is the same. 

59 



Interestingly we get the same ratio comparing our value for the Debye 
mass eg . (12211) with the leading order perturbative one, M^^^^ = ^agNT"^, 

— ^ - 0.85 . 224 

Ml > 

The pressure approaches its asymptotic value according to the simple 
formula 

P(T) , T; 



c 



P T4 

' asympt ' 



(225) 



Here the asymptotic value of the pressure Pasympt is given by eq. ()214|) . The 
pressure P(T) is given by the difference between eq. ()214|) and the value of the 
free energy at zero temperature, which coincides with expression eq. ()215p . 

One has to take the comparison eqs. ()223f224|l with a grain of salt. As 
explained above, our calculations were performed assuming small H. A priori 
we expect that this restriction should confine us to not too large tempera- 
tures. On the other hand the minimization of the free energy resulted in 
the value H/M = 1/e independently of temperature. Thus, we feel that the 
comparison eq. ()223j) may be meaningful. 

The main features of these results are indeed what we expect from the 
deconfinement phase transition on general grounds. It is nice that a simple 
minded calculation such as this does quahtatively so well in such a compli- 
cated problem. It therefore appears that the projection of the trial density 
matrix on the gauge invariant Hilbert space is, just like at zero tempera- 
ture, the crucial feature that dictates most if not all the important aspects 
of the low energy and low temperature physics. In the context of the present 
calculation the most important effect of the gauge projection is obviously 
vanishing of the entropy in the low temperature phase. We stress that this 
feature was not at all built into our initial ansatz, but followed naturally and 
unavoidably in the disordered phase of the effective cr-model. 

Quantitatively, this calculation of course should be taken for what it is 
— an approximate implementation of the variational principle. As with 
any variational calculation, the range of validity of this calculation is not 
sharply defined. Even within the variational framework we had to resort 
to additional approximations. The most severe simplifications that we had 
to impose are the perturbation theory in the ordered phase of the a-model 
and the assumption of smallness of H. The projection over the gauge group. 
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which as we saw is so physically important, is what makes the calculational 
task difficult and forces us to make these approximations. 

The assumptions of smallness of g and of smallness of H affect different 
aspects of our result. In particular, in the leading order of the perturbation 
theory the expectation value of the Polyakov loop U is equal to unity. The 
actual value of U on the ordered side of the transition according to jT^I is 
close to one half. Thus our perturbative calculation is rather more reliable 
somewhat further away from the transition. The closer to the transition 
we get, the more important higher order corrections in g"^ become. Thus 
to properly describe the transition region of QCD we need to improve our 
calculational method in the vicinity of the transition in the cr-model. In 
line with this we expect that the estimate for the critical temperature we 
obtained here is somewhat higher than we would get, had we treated the 
cr-model more accurately in the transition region. This is consistent with the 
fact that our result for Tc is by about 50% higher than the lattice value of 
270 Mev. 

The smallness of h is quite important in a different way. The value of 
h = 1/e that we obtain is in fact a reasonably small number, so omitting 
the corrections in powers of h is fairly safe. On the other hand, the terms 
linear in h but not enhanced by Inh, which we have ignored in the present 
calculation, have to be accounted for more carefully. With the value of h 
that we obtain, these terms are not suppressed in any obvious way. 

The obvious stumbling block to any improvement along these lines is the 
calculation of the entropy S = — tr^ In g. However, if one opts for restricting, 
as before, the analysis to leading order in g^ the entropy and, therefore, the 
free energy can be, without any additional approximations, calculated to all 
orders in h [74J. This improved analysis is carried out in the following. 

5.5 All-order in h analysis 

Let us ask ourselves what would happen if we did not restrict H to be small, 
and more generally did not restrict the functional forms of G{k) and H{k) 
in our variational ansatz. We could still carry on our calculation for a while. 
Namely we would be able to integrate over the vector potentials in all aver- 
ages, and would reduce the calculation to a consideration of some non-linear 
(j-model of the [/-field. This a-model quite generally will have a symmetry 
breaking phase transition as the variational functions G{k) and H{k) are 
varied. Since at this transition the Polyakov loop U changes its behaviour, 
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the disordered phase of the cr-model corresponds to the confining phase of 
the Yang Mills theory, while the ordered phase of the cr-model represents the 
deconfined phase. Thus, in order to study deconfinement in the SU{N) Yang 
Mills theory, we should analyze the physics of each a-model phase as accu- 
rately as possible and calculate the transition scale Mc (or rather Gc{k)). We 
then calculate the free energy of the a-model in each phase at temperature 
T and extract the minimal free energy. The deconfinement transition occurs 
at the temperature for which the free energies calculated in the ordered and 
disordered phases of the sigma model coincide. 

In practice in the disordered phase no progress seems possible without 
restricting the arbitrary kernels and we adopt the forms eqs. (|157I158|) . The 
resulting minimal free energy is thus independent of the temperature and is 
given by eq. ()215|) . 

On the other hand, in the ordered phase we can relax the restriction on 
H and G if, as before we work in the leading order in perturbation theory. 
In this case minimization with respect to arbitrary kernels G~^{k) and H(k) 
is possible. We now describe this calculation following |73]. 

In this approximation, for the U matrices we use the parameterization 
eq. ()203|) . Hence at leading order one can take 

d,U - zgd.rY- ^^^^^ 

Thus, the gauge transformations eq. (|7^ reduce to 

A1 ^A^- 84" (227) 

and the Hamiltonian eq. (p7j) reduces to 

H = i [Ef + {e,,ud,Alf] . (228) 

These last two equations describe the theory t/(l)^ ~^: in the leading order 
of the a - model perturbation theory, the SU{N) Yang-Mills theory reduces 
to the f/(l)^ "^ free theory. The density matrix eq. p60p becomes Gaussian 
again, because the gauge transformations are linear. One has 

g[A, A'] = f D(j) expl -^ IaG'^A + {A' - d(f))G-\A' - 90) 

- 2AH{A' - 90)1 1 . (229) 
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Now the theory of A^^ — 1 U{1) free fields in 3 + 1 dimensions is completely 
tractable. Both the energy and the entropy can be calculated explicitly [7^] . 
The free energy in terms of the arbitrary kernels G~^ and H is 



N^ 



(2^ 



^-^(1 + GH)+p^G{l - GHY^ 
GH 



- 4T In 



i 



In 



7] 






where 77 = 1 - (1 - {GHffl^ and ^ = (1 - {GHff/^ 
It is minimized by 



GH 

-{l-GH). 



(230) 



G-i 



P 



H = 2p 



1 + e-^ 

1 ^^ 
1 — e T 

e T 

TIF 
— e T 



(231) 



and the minimal value of the free energy at temperature T is 

_ Ar2-1 



7r^ 



p 



p dp — h Tln(l — e 



^p/T^ 



(iV2 _ i)T4 /•- , a;3 



37r2 
7r2(Ar2 - 1)T 



e^ 



45 



(232) 



where, just like in eq. (I215|l . zero-point term has been discarded. This is, of 
course, just the free energy of a free photon gas. 

Thus the free energy of SU{N) Yang Mills theory is minimized with 
M = Mc in the disordered phase of the a- model for temperatures below T^, 
which is obtained by equating the free energies eqs. ()215|) and 



which yields 



T. 



' 307r2 



SV^^M, 



2Ar2x4 



45" 



-^ ^ 470 MeV . 



(233) 



(234) 
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The all order in h improvement discussed here allows us to take more 
seriously our results at high temperature. At high temperatures the kernel, 
which corresponds to the Boltzmann factor, is of order unity and thus our 
original assumption of smallness of H. This is indeed obvious from eq. ()231|) . 
As a result we now reproduce the expected asymptotic free gluon plasma 
result for the free energy. 

On the other hand this improvement affected very little our previous 
results in the transition region. The transition temperature is shifted only 
by about 5%. The same is true for the the value of the parameter H at low 
momentum. As before, we find that the deconfinement phase transition is 
strongly first order with latent heat /S.E = ^^M^T^. 

Although the actual value of the transition temperature is considerably 
larger than the lattice estimate, as explained earlier it makes more sense 
to look at dimensionless quantities. In particular, if we identify 2Mc with 
the mass of the lightest glueball (see however J42j and the discussion in the 
previous section), we find 

This is in excellent agreement with the lattice estimate for SU{3) pure gauge 
theory jTH]. We should however caution that given the uncertainties in our 
calculation this agreement may well be fortuitous. 



6 Conclusions 

We have tried here to critically review the application of the variational 
principle to Quantum Field Theories with gauge invariance, with the main 
focus on the approach developed by Ian Kogan and collaborators P^I29|KH 

Eg ESI- 

Although it is too early to decide whether this approach can be a use- 
ful calculational scheme for strongly interacting gauge theories, we can draw 
encouragement from its performance in the non-trivial toy models. In par- 
ticular, in compact QED in 2 + 1 dimensions, we have been able to reproduce 
all known non-trivial characteristics of the non-perturbative vacuum state: 
dynamical mass generation, confining potential between external charges and 
area law behaviour of the spatial Wilson loop with parameterically correct 
values for the string tension and mass. Although this is the only example that 
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we have covered extensively in this review, the method has also been applied 
to other lower dimensional systems, and it works very well in all cases. Thus 
the deconfining phase transition in 2+1 compact QED at finite temperature 
is described correctly 021 • In the (exactly solvable) Schwinger model the vari- 
ational approach reproduces the exact ground state wave functional [77j. In 
the compact 2+1 QED with Chern-Simons term [7H] it predicts a Kosterlitz- 
Thouless phase transition in the value of the Chern-Simons parameter, in 
agreement with earlier analysis [7^. 

In 3+1 dimensional gluodynamics, this variational method gives results 
which on the qualitative level at least, conform with our intuition about 
the structure of the ground state, both at zero and finite temperature. We 
find dynamical mass generation, corresponding to an acceptable value of the 
gluon condensate. At finite temperature we find a first order phase transi- 
tion which corresponds to the Polyakov loop acquiring a non-zero average. 
Although we have not calculated the string tension directly, the behaviour of 
the Polyakov loop is very much indicative that this is indeed the deconfining 
phase transition. The value of the critical temperature (in units of glueball 
mass) we find is in good agreement with lattice results. We also found that 
in the low temperature phase the entropy remains zero all the way up to 
the transition temperature. This is a rather striking result, which has not 
been built into our variational ansatz, but rather emerged as the result of 
the dynamical calculation. 

An important lesson we learned from the lower dimensional models is 
that the projection of the Gaussian trial state onto the gauge invariant 
Hilbert subspace dictates most, if not all, of the important aspects of the 
non-perturbative physics. It was absolutely essential to perform the pro- 
jection non-perturbatively, fully taking into account the contribution of the 
overlap between gauge rotated Gaussians into the variational energy prior to 
minimization. 

The same conclusion carries over to the pure Yang Mills theory. We 
have seen that from the point of view of the effective cr-model the energy 
is minimized in the disordered phase. In other words, the low momentum 
fluctuations of the fleld U are large, unlike in the perturbative regime, where 
U is close to a unit matrix. From the point of view of the trial wave functional, 
this means that the off-diagonal contributions, coming from the Gaussian 
WF gauge rotated by a slowly varying gauge transformation, are large. It is 
these "off diagonal" contributions to the energy that lowered the energy of the 
best trial state below the perturbative value. In the low temperature phase 
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the vanishing of the entropy was also a direct consequence of the effective 
cr-model being in the disordered phase, and thus of the non-perturbative 
nature of the gauge projection. The accounting for these off diagonal terms 
non-perturbatively is the main distinction between this approach and other 
attempts [211 HZl I2H1 1211 I2H1 121| to implement the variational principle in 
gauge theories. 

Many outstanding questions remain. Is the best variational state confin- 
ing? How do we calculate the interaction potential between external sources? 
How do we understand better the relation between the variational parameter 
and the glueball masses? Can we extend the Ansatz to include (massless) 
fermions? 

Both to be confident in our results and to be able to approach these ques- 
tions we need first and foremost to have a better way of treating analytically 
the effective non-linear a-model. The use of the mean field approximation in 
the effective a-model was the main source of uncertainties in our calculations 
both at zero and finite temperature. We believe that it should be possible 
to treat the a-model in a better way, perhaps along the lines of a continuum 
version of [^. Such an improvement is crucial to clarify whether the qualita- 
tively appeahng results that we have described here are a kind of fluke due to 
an interplay of two bad approximations (variational and mean field) or are 
genuine predictions of a useful, workable variational approach. Personally 
we do believe that these results are genuine and that there is enough scope 
for further development of the approach which warrants continuing active 
investigations. 
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